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[Title of Document] CLAIMS 
[Claim 1] 

A replicon RNA, comprising a nucleotide sequence containing at least the 5' 
untranslated region, the nucleotide sequence encoding NS3 protein, NS4A protein, 
NS4B protein, NS5A protein and NS5B protein and the 3' untranslated region on 
the genomic RNA of hepatitis C virus of genotype 2a. 
[Claim 2] 

The replicon RNA of claim 1, containing at least one selection marker gene or a 
reporter gene, and at least one IRES sequence. 
[Claim 3] 

A replicon RNA, comprising a nucleotide sequence containing the 5' untranslated 
region comprising the nucleotide sequence represented by SEQ ID NO: 9 or 10; at 
least one selection marker gene or a reporter gene; an IRES sequence; the 
nucleotide sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A 
protein and NS5B protein on the genomic RNA of hepatitis C virus of genotype 
2a; and the 3' untranslated region comprising the nucleotide sequence represented 
by SEQ ID NO: 11 or 12. 
[Claim 4] 

The replicon RNA of any one of claims 1 to 3, wherein the genomic RNA of 
hepatitis C virus of genotype 2a is an RNA comprising the nucleotide sequence 
represented by SEQ ID NO: 3 or 5. 
[Claim 5] 

A replicon RNA, comprising the following RNA (a) or (b): 

(a) an RNA comprising the nucleotide sequence represented by SEQ ID NO: 1 or 
2; and 

(b) an RNA comprising a nucleotide sequence derived from the nucleotide 
sequence represented by SEQ ID NO: 1 or 2 by deletion, substitution or addition 
of 1 to 10 nucleotides, and being capable of autonomous replication. 

[Claim 6] 
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A replicon-replicating cell, which is prepared by introducing the rep 1 icon RNA of 
any one of claims 1 to 5 into a cell. 
[Claim 7] 

The replicon-replicating cell of claim 6, wherein the cell is a eukaryotic cell. 
[Claim 8] 

The replicon-replicating cell of claim 7, wherein the eukaryotic cell is a human 
liver-derived cell, a human uterine cervix-derived cell or a human fetal kidney- 
derived cell. 
[Claim 9] 

The replicon-replicating cell of claim 7, wherein the eukaryotic cell is any one cell 
selected from the group consisting of an Huh7 cell, an HepG2 cell, an IMY-N9 cell, 
an HeLa cell and a 293 cell. 
[Claim 10] 

The replicon RNA of any one of claims 1 to 5, which is for producing or 
evaluating a therapeutic agent or a diagnostic agent against hepatitis C virus 
infection. 
[Claim 11] 

The replicon-replicating cell of any one of claims 6 to 9, which is for producing or 
evaluating a therapeutic agent or a diagnostic agent against hepatitis C virus 
infection. 
[Claim 12] 

The replicon RNA of any one of claims 1 to 5, which is for producing a vaccine 
against hepatitis C virus infection. 
[Claim 13] 

The replicon-replicating cell of any one of claims 6 to 9, which is for producing a 
vaccine against hepatitis C virus infection. 
[Claim 14] 

A method of producing a replicon RNA of hepatitis C virus of genotype 2a, 
comprising extracting the replicon RNA from the replicon-replicating cell of any 
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one of claims 6 to 9. 
[Claim 15] 

A method of producing a viral protein of hepatitis C virus of genotype 2a, 
comprising culturing the replicon-replicating cell of any one of claims 6 to 9, and 
obtaining the viral protein from the resulting culture product. 
[Claim 16] 

A method of screening for a substance promoting or suppressing the replication of 
hepatitis C virus, comprising culturing the replicon-replicating cell of any one of 
claims 6 to 9 in the presence of a test substance, and detecting the replication of a 
replicon RNA in the resulting culture product. 
[Claim 17] 

A method of increasing the replication efficiency of the replicon RNA of hepatitis 
C virus of genotype 2a, comprising performing once or more the following: 
obtaining a replicated replicon RNA from the replicon-replicating cell of any one 
of claims 6 to 9, and introducing the thus obtained replicated replicon RNA into a 
cell that is different from the replicon-replicating ceil so as to prepare a new 
replicon-replicating cell. 
[Claim 18] 

The method of claim 17, wherein the replication efficiency increases to become at 
least two times greater than that of the replicon RNA that is introduced at the 
beginning into the replicon-replicating cell. 
[Claim 19] 

A method of producing a replicon RNA of hepatitis C virus of genotype 2a having 
increased replication efficiency, comprising performing once or more the 
following: obtaining a replicated replicon RNA from the replicon-replicating cell 
of any one of claims 6 to 9, and introducing the thus obtained replicated replicon 
RNA into a cell that is different from the replicon-replicating cell so as to prepare 
a new replicon-replicating cell; and obtaining a replicated replicon RNA from the 
finally obtained replicon-replicating cell. 
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[Claim 20] 

A method of producing a replicon RNA of hepatitis C virus of genotype 2a having 
increased replication efficiency, comprising detecting a nucleotide mutation or an 
amino acid mutation between the replicon RNA that is produced so as to have an 
increased replication efficiency by the method of claim 19 and the replicon RNA 
that is introduced at the beginning into the replicon-replicating cell; and 
introducing the thus detected nucleotide mutation or amino acid mutation into a 
replicon RNA whose replication efficiency is to be increased. 
[Claim 21] 

A replicon RNA, comprising a nucleotide sequence derived from the nucleotide 
sequence represented by SEQ ID NO: 1 by at least one mutation selected from the 
group consisting of the following (a) to (u): 

(a) a mutation from A to G at nucleotide site 7157; 

(b) a mutation from C to U at nucleotide site 4955; 

(c) a mutation from A to G at nucleotide site 4936; 

(d) a mutation from A to G at nucleotide site 5000; 

(e) a mutation from A to G at nucleotide site 7288; 

(f) a mutation from G to U at nucleotide site 5901; 

(g) a mutation from A to U at nucleotide site 61 13; 

(h) a mutation from A to G at nucleotide site 2890; 

(i) a mutation from C to A at nucleotide site 6826; 
(j) a mutation from C to A at nucleotide site 6887; 
(k) a mutation from U to A at nucleotide site 6580; 
(1) a mutation from U to C at nucleotide site 7159; 
(m) a mutation from U to A at nucleotide site 7230; 
(n) a mutation from C to A at nucleotide site 6943; 
(o) a mutation from G to A at nucleotide site 5687; 
(p) a mutation from A to G at nucleotide site 6110; 
(q) a mutation from U to C at nucleotide site 5550; 
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(r) a mutation from A to G at nucleotide site 7217; 
(s) a mutation from A to G at nucleotide site 3643; 
(t) a mutation from G to A at nucleotide site 5851 ; and 
(u) a mutation from G to A at nucleotide site 5914. 
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[Title of Document] DESCRIPTION 

[Title Of Invention] A NUCLEIC ACID CONSTRUCT CONTAINING A NUCLEIC 
ACID DERIVED FROM THE GENOME OF HEPATITIS C VIRUS (HCV) OF 
GENOTYPE 2a, AND A CELL HAVING SUCH NUCLEIC ACID CONSTRUCT 
INTRODUCED THEREIN 
[Technical Field] 
[0001] 

The present invention relates to a replicon RNA of the hepatitis C virus of 
genotype 2a, a replicon-replicating cell wherein the replicon RNA is introduced, 
and a method of increasing the replication efficiency of the replicon RNA. 
[Background Art] 
[0002] 

The hepatitis C virus (HCV) is a virus belonging to the family 
Flaviviridae. It has a single-stranded (+) strand sense RNA as its genome and is 
known to cause hepatitis C. Recent studies have revealed that Hepatitis C virus 
is classified into a number of types based on genotypes or serotypes. According 
to the phylogenetic analysis of Simmonds et al., using the nucleotide sequences of 
the HCV strains, which is currently a mainstream method of classifying HCV 
genotypes, FICV is classified into 6 genotypes: genotype la, genotype lb, 
genotype 2a, genotype 2b, genotype 3a and genotype 3b (see Non Patent Literature 
1). Each of these types is further classified into several subtypes. The 
nucleotide sequences of the full-length genomes of a several number of genotypes 
of HCV have been determined to date (see Patent Literature 1 and Non Patent 
Literatures 2-5). 
[0003] 

HCV causes chronic hepatitis by persistent infection. Currently, the main 
cause of chronic hepatitis observed worldwide is persistent HCV infection. 
Actually, around 50% of individuals with persistent infection develop chronic 
hepatitis. Chronic hepatitis in approximately 20% of these patients shifts to liver 
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cirrhosis over the course of 10 to 20 years, and some of these patients further go 

on to advanced lethal pathological conditions such as hepatic cancer. 

[0004] 

Hepatitis C is currently treated mainly by a therapy using interferon-a or 
interferon- p\ or a therapy using in combination interferon-a and ribavirin, the 
purine-nucleoside derivative. However, even when these therapies are 
performed, the therapeutic effects are observed in only approximately 60% of all 
the treated patients. When the therapies are ceased after the exertion of the 
effects, the disease recrudesces in more than half of the patients. The therapeutic 
effect of interferones is known to relate to HCV genotypes, and is said to be lower 
against genotype 1 b and higher against genotype 2a (see Non Patent Literature 6). 
[0005] 

It is an important goal to develop therapeutic agents or prophylactic agents 
effective against hepatitis C, the incidence rate of which is high in industrial 
countries, for which currently no causal treatment are present, and which finally 
bring about serious results. Hence, the development of HCV-specific 
chemotherapies and vaccine therapies are earnestly desired. A target for the 
development of an anti-HCV agent may be the suppression of HCV replication or 
the suppression of infection of cells with HCV. 
[0006] 

Until recently, propagation of HCV in a cell culture system and infecting 
cultured ceils with HCV have been difficult. Moreover, a chimpanzee has been 
the only animal that can be infected with HCV and can be used in experiments, so 
that it has been difficult to carry out studies on the replication mechanism of HCV 
and the infection mechanism of HCV. However, recently, HCV subgenomic RNA 
rep I icons have been prepared as HCV-derived autonomously replicable RNA (see 
Patent Literature 2 and Non Patent Literatures 7-10), which enables the analysis of 
the replication mechanism of HCV using cultured cells. These HCV subgenomic 
RNA repiicons are each prepared by substituting structural proteins existing 
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downstream of HCV IRES in the 5* untranslated region of the HCV genomic RNA 
of genotype lb with a neomycin resistance gene and EMCV IRES that has been 
ligated downstream of the resistance gene. It has been demonstrated that this 
RNA replicon is autonomously replicated in human hepatic cancer cells, Huh7 
cells, when introduced into the Huh7 cells followed by culture in the presence of 
neomycin. 
[0007] 

However, regarding such intracellular RNA replication systems for HCV, 
only those using HCV genomic RNA of genotype lb have been prepared so far. 
Since there has been a report that different genotypes of HCV differ also in viral 
proteins encoded, it may be difficult to sufficiently elucidate the replication 
mechanism of HCV only by analyzing the subgenomic RNA replicons derived 
from HCV of genotype lb. Furthermore, based on the fact that the therapeutic 
effects of interferons differ depending on the HCV genotypes, it may be 
particularly difficult to develop an anti-HCV agent having an effect on various 
types of HCV by the use of only an HCV replication system containing the 
subgenomic RNA replicon of HCV of genotype lb. 
[0008] 

[Patent Literature 1] JP Patent Publication (Kokai) No. 2002-171978 A 
[Patent Literature 2] JP Patent Publication (Kokai) No, 2001-17187 A 
[Non Patent Literature 1] Simmonds, P. et al, Hepatology, (1994) 10, pp. 1321- 
1324 

[Non Patent Literature 2] Choo et al., Science, (1989) 244, pp. 359-362 
[Non Patent Literature 3] Kato et al., J. Med. Virol, (2001) 64(3) pp. 334-339 
[Non Patent Literature 4] Okamoto, H et al, J. Gen. Virol., (1992) 73 pp. 673-679 
[Non Patent Literature 5] Mori, S. et al, Biochem. Biophis. Res. Comraun., 
(1992) 183, pp. 334-342 
[Non Patent Literature 6] Yoshioka et al., Hepatology, (1992) 16(2): pp. 293-299 
[Non Patent Literature 7] Lohmann et al., Science, (1999) 285, pp. 110-113 
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[Non Patent Literature 8] Blight et al., Science, (2000) 290, pp. 1972-1974 
[Non Patent Literature 9] Friebe et al., J. Virol., (2001) 75(24): pp. 12047-12057 
[Non Patent Literature 10] Ikeda et al., J. Virol., (2002) 76(6): pp. 2997-3006 

[Disclosure of Invention] 

[Problem to be Solved by Invention] 

[0009] 

An object of the present invention is to provide an HCV-derived replicon 
RNA of a HCV genotype for which replicon RNA has not yet been prepared. 
[Means for Solving the Problem] 
[0010] 

As a result of intensive studies to achieve the above object, we have 
succeeded in preparing the replicon RNA of HCV genotype 2a. 
[0011] 

That is, the present invention is as follows. 
[1] A replicon RNA, comprising a nucleotide sequence containing at least the 5' 
untranslated region, the nucleotide sequence encoding NS3 protein, NS4A protein, 
NS4B protein, NS5A protein and NS5B protein and the 3' untranslated region on 
the genomic RNA of hepatitis C virus of genotype 2a. Preferably, this replicon 
RNA further contains at least one selection marker gene or a reporter gene, and at 
least one IRES sequence. 

[2] A replicon RNA, comprising a nucleotide sequence containing the 5' 
untranslated region comprising the nucleotide sequence represented by either SEQ 
ID NO: 9 or 10; at least one selection marker gene or a reporter gene; an IRES 
sequence; the nucleotide sequence encoding NS3 protein, NS4A protein, NS4B 
protein, NS5A protein and NS5B protein on the genomic RNA of hepatitis C virus 
of genotype 2a; and the 3' untranslated region comprising the nucleotide sequence 
represented by either SEQ ID NO: 11 or 12. 

[3] The replicon RNA of [1] or [2] above, wherein the genomic RNA of hepatitis C 
virus of genotype 2a is an RNA comprising the nucleotide sequence represented by 
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SEQ ID NO: 3 or 5. 

[4] A repiicon RNA, comprising the following RNA (a) or (b): 

(a) an RNA comprising the nucleotide sequence represented by SEQ ID NO: 1 or 
2; and 

(b) an RNA comprising a nucleotide sequence derived from the nucleotide 
sequence represented by SEQ ID NO: 1 or 2 by deletion, substitution or addition 
of 1 to 10 nucleotides, and being capable of autonomous replication. 

[5] A replicon-replicating cell, which is prepared by introducing the repiicon RNA 
of any one of [I] to [4] above into a cell. For this replicon-replicating ceil, a cell 
into which the repiicon RNA is introduced is preferably a eukaryotic cell, more 
preferably a human liver-derived cell, a human uterine cervix-derived cell or a 
human fetal kidney-derived cell, and further more preferably any one cell selected 
from the group consisting of an Huh7 cell, an HepG2 cell, an IMY-N9 cell, an 
HeLa cell and a 293 cell. 

[6] The repiicon RNA of [1] to [4] above, which is for producing or evaluating a 

therapeutic agent or a diagnostic agent against hepatitis C virus infection, 

[7] The replicon-replicating cell of [5] above, which is for producing or evaluating 

a therapeutic agent or a diagnostic agent against hepatitis C virus infection. 

[8] The repiicon RNA of [1] to [4] above, which is for producing a vaccine against 

hepatitis C virus infection. 

[9] The replicon-replicating cell of [5] above, which is for producing a vaccine 
against hepatitis C virus infection. 

[10] A method of producing a repiicon RNA of hepatitis C virus of genotype 2a, 
comprising extracting the repiicon RNA from the replicon-replicating cell of [5] 
above. 

[11] A method of producing a viral protein of hepatitis C virus of genotype 2a, 
comprising culturing the replicon-replicating cell of [5] above, and obtaining the 
viral protein from the resulting culture product. 

[12] A method of screening for a substance promoting or suppressing the 
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replication of hepatitis C virus, comprising culturing the replicon-replicating cell 
of [5] above in the presence of a test substance, and detecting the replication of a 
replicon RNA in the resulting culture product. 

[13] A method of increasing the replication efficiency of the replicon RNA of 
hepatitis C virus of genotype 2a, comprising performing once or more the 
following: obtaining a replicated replicon RNA from the replicon-replicating cell 
of [5] above, and introducing the thus obtained replicated replicon RNA into a cell 
that is different from the replicon-replicating cell so as to prepare a new replicon- 
replicating cell. In this method, it is more preferred that the replication 
efficiency increases to become preferably at least two times greater than that of 
the replicon RNA that is introduced at the beginning into the replicon-replicating 
cell. 

[14] A method of producing a replicon RNA of hepatitis C virus of genotype 2a 
having increased replication efficiency, comprising performing once or more the 
following: obtaining a replicated replicon RNA from the replicon-replicating cell 
of [5] above, and introducing the thus obtained replicated replicon RNA into a cell 
that is different from the replicon-replicating cell so as to prepare a new replicon- 
replicating cell; and obtaining a replicated replicon RNA from the finally obtained 
replicon-replicating cell. 

[15] A method of producing a replicon RNA of hepatitis C virus of genotype 2a 
having increased replication efficiency, comprising detecting a nucleotide 
mutation or an amino acid mutation between the replicon RNA that is produced so 
as to have an increased replication efficiency by the method of [14] above and the 
replicon RNA that is introduced at the beginning into the replicon-replicating cell; 
and introducing the thus detected nucleotide mutation or amino acid mutation into 
a replicon RNA whose replication efficiency is to be increased. 
[16] A replicon RNA, comprising a nucleotide sequence derived from the 
nucleotide sequence represented by SEQ ID NO: 1 by at least one mutation 
selected from the group consisting of the following (a) to (u): 
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(a) a mutation from A to G at nucleotide site 7157; 

(b) a mutation from C to U at nucleotide site 4955; 

(c) a mutation from A to G at nucleotide site 4936; 

(d) a mutation from A to G at nucleotide site 5000; 

(e) a mutation from A to G at nucleotide site 7288; 

(f) a mutation from G to U at nucleotide site 5901; 

(g) a mutation from A to U at nucleotide site 6113; 

(h) a mutation from A to G at nucleotide site 2890; 

(i) a mutation from C to A at nucleotide site 6826; 
(j) a mutation from C to A at nucleotide site 6887; 
(k) a mutation from U to A at nucleotide site 6580; 
(1) a mutation from U to C at nucleotide site 7159; 
(m) a mutation from U to A at nucleotide site 7230; 
(n) a mutation from C to A at nucleotide site 6943; 
(o) a mutation from G to A at nucleotide site 5687; 
(p) a mutation from A to G at nucleotide site 6110; 
(q) a mutation from U to C at nucleotide site 5550; 
(r) a mutation from A to G at nucleotide site 7217; 
(s) a mutation from A to G at nucleotide site 3643; 

(t) a mutation from G to A at nucleotide site 5851; and 
(u) a mutation from G to A at nucleotide site 5914. 
[Effects of Invention] 
[0012] 

According to the present invention, an HCV-RNA replicon derived from 
the genotype 2a strain of HCV has been provided for the first time. The replicon- 
replicating cell according to the present invention can be used as a culture system 
for the continuous production of RNA and HCV proteins derived from HCV of 
genotype 2a. Furthermore, the replicon-replicating cell according to the present 
invention is useful as a test system for screening for various substances that affect 
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HCV replication and/or the translation of HCV proteins. 

[Best Mode for Carrying out Invention] 

[0013] 

The present invention is explained in detail as follows. 

[0014] 

1. HCV-derived replicon RNA according to the present invention 

The genome of hepatitis C virus (HCV) is a single-stranded (+) strand 
RNA comprising approximately 9600 nucleotides. This genomic RNA comprises 
the 5' untranslated region (also denoted as 5' NTR or 5' UTR), a translated region 
composed of a structural region and a non-structural region and the 3' untranslated 
region (also denoted as 3' NTR or 3' UTR). HCV structural proteins are encoded 
in the structural region, and a plurality of non-structural proteins are encoded in 
the non-structural region. 
[0015] 

Such HCV structural proteins and non-structural proteins are generated 
through the translation into a continuous form thereof, a polyprotein, from the 
translated region, restricted degradation of the polyprotein by protease, and then 
the release of the structural proteins (Core, EI and E2) and non-structural proteins 
(NS2, NS3, NS4A, NS4B, NS5A and NS5B), respectively. Among these 
structural proteins and non-structural proteins, that is, viral proteins of HCV, Core 
is a core protein, El and E2 are envelope proteins, and non-structural proteins 
(NS2, NS3, NS4A, NS4B, NS5A and NS5B) are proteins involved in virus's own 
replication. NS2 is known to have metalloprotease activity, and NS3 is known to 
have serine protease activity (at one-third of the N terminal side) and helicase 
activity (at two-thirds of the C-terminal side). Furthermore, NS4A is a cofactor 
for protease activity of NS3, and NS5B has been reported to have RNA-dependent 
RNA polymerase activity. Furthermore, the genome of HCV of genotype 2a has 
already been reported to have a similar gene structure (see Patent Literature 1), 
[0016] 
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We have constructed RNA capable of autonomous replication using such 
HCV genome of genotype 2a. Specifically, the PICV-derived replicon RNA of the 
present invention is an RNA construct, which contains the whole or partial RNA of 
the HCV genome of genotype 2a and is capable of autonomous replication. 
[0017] 

In this specification, RNA that is prepared by altering the viral genome of 
HCV and is capable of autonomous replication is referred to as "replicon RNA" or 
"RNA replicon." RNA that is artificially prepared from HCV of genotype 2a and 
is capable of autonomous replication is referred to as "replicon RNA derived from 
HCV of genotype 2a." In this specification, the HCV-derived replicon RNA is 
also referred to as an HCV- RNA replicon. 
[0018] 

In the present invention, "hepatitis C virus of genotype 2a" or "HCV of 
genotype 2a" means hepatitis C virus identified as genotype 2a according to the 
international classification of Simmonds et al. The "hepatitis C virus of genotype 
2a" or the "HCV of genotype 2a" of the present invention encompasses not only a 
virus having naturally occurring HCV genomic RNA, but also a virus having 
genomic RNA prepared by artificially altering a naturally occurring HCV genomic 
sequence. Specific examples of HCV of genotype 2a include viruses of JFH-1 
strain and the JCH-1 strain (see Patent Literature 1). 
[0019] 

Furthermore, "the genomic RNA of hepatitis C virus of genotype 2a" 
means RNA that comprises the single-stranded (+) strand sense RNA of hepatitis C 
virus of genotype 2a and has the nucleotide sequence throughout the entire region 
of its genome. The genomic RNA of hepatitis C virus of genotype 2a is 
preferably RNA comprising the nucleotide sequence represented by SEQ ID NO: 3 
or 5, but is not limited thereto. 
[0020] 

In the specification of the present application, "5' untranslated region" 
14 



JP Application No. 2003-329115 



(5'NTR or 5'UTR), "a sequence encoding NS3 protein, NS4A protein, NS4B 
protein, NS5A protein and NS5B protein," "a sequence encoding Core protein" 
(Core region or C region), "a sequence encoding El protein" (El region), "a 
sequence encoding E2 protein" (E2 region), "a sequence encoding N2 protein" 
(NS2 region), "a sequence encoding NS3 protein" (NS3 region), "a sequence 
encoding NS4A protein" (NS4A region), "a sequence encoding NS4B protein" 
(NS4B region), "a sequence encoding NS5A protein" (NS5A region), "a sequence 
encoding NS5B protein" (NS5B region) and "3" untranslated region" (3' NTR or 3' 
UTR), and other specific regions or sites are determined based on the nucleotide 
sequence of SEQ ID NO: 3 of the full-length cDNA (JFH-1 clone) encoding the 
entire region of the genome of the JFH-1 strain, which is HCV of genotype 2a. 
The nucleotide sequence of SEQ ID NO: 3 can be obtained from the International 
DNA Data Bank (DDBJ/EMBL/GenBank) by referring to the accession No. 
AB047639. Specifically, when a particular HCV RNA sequence is aligned with 
the nucleotide sequence represented by SEQ ID NO: 3, a sequence to be aligned 
with nucleotides 1 to 340 on the nucleotide sequence represented by SEQ ID NO: 
3 is "5' untranslated region" of the RNA, a sequence to be aligned with the 
nucleotides 3431 to 9442 on the same are a sequence encoding NS3 protein, NS4A 
protein, NS4B protein, NS5A protein and NS5B protein, a sequence to be aligned 
with the nucleotides 3431 to 5323 on the same is "a sequence encoding NS3 
protein," a sequence to be aligned with the nucleotides 5324 to 5485 on the same 
is "a sequence encoding NS4A protein," a sequence to be aligned with the 
nucleotides 5486 to 6268 on the same is a sequence encoding NS4B protein," a 
sequence to be aligned with the nucleotides 6269 to 7666 on the same is "a 
sequence encoding NS5A protein," a sequence to be aligned with the nucleotides 
7667 to 9442 on the same is "a sequence encoding NS5B protein," and a sequence 
to be aligned with the nucleotides 9443 to 9678 on the same is "3' untranslated 
region." Furthermore, in this case, gaps, additions, deletions, substitutions or the 
like may be present in the "aligned" sequences. Furthermore, the above 
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"particular HCV" is not limited thereto, and includes the JFH-1 strain or JCH-1 

strain, or viral strains that are derivatives thereof. 

[0021] 

One embodiment of the HCV RNA-replicon according to the present 
invention is a replicon RNA comprising a nucleotide sequence containing at least 
the 5' untranslated region, a sequence encoding NS3 protein, NS4A protein, NS4B 
protein, NS5A protein and NS5B protein, and the 3' untranslated region on the 
genomic RNA of hepatitis C virus of genotype 2a. The replicon RNA may further 
contain at least one selection marker gene or one reporter gene, and at least one 
IRES sequence. Furthermore, this replicon RNA may also contain a sequence 
encoding a viral protein other than NS3, NS4A, NS4B, NS5A and NS5B proteins 
on the genomic RNA of hepatitis C virus of genotype 2a. 
[0022] 

Another preferred embodiment of HCV RNA-replicon according to the 
present invention is a replicon RNA comprising a nucleotide sequence containing 
the 5' untranslated region comprising the nucleotide sequence represented by SEQ 
ID NO: 9 or 10, at least one selection marker gene or reporter gene, the IRES 
sequence, a sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A 
protein and NS5B protein on the genomic RNA of hepatitis C virus of genotype 
2a, and the 3' untranslated region comprising the nucleotide sequence represented 
by SEQ ID NO: 11 or 12, In this case the nucleotide sequences represented by 
SEQ ID NO: 9 and 10 are sequences of the 5' untranslated regions of rSGREP- 
JFH1 (SEQ ID NO: 1) and rSGREP-JCHl (SEQ ID NO: 2), respectively, which are 
replicon RNAs according to the present invention. Furthermore, the nucleotide 
sequences represented by SEQ ID NO: 11 and 12 are sequences of the 3' 
untranslated regions of rSGREP-JFHl (SEQ ID NO: 1) and rSGREP-JCHl (SEQ 
ID NO: 2), respectively, which are replicon RNAs according to the present 
invention, 
[0023] 
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A more preferred embodiment of HCY RNA-replicon according to the 
present invention is a replicon RNA comprised of an RNA comprising the 
nucleotide sequence represented by SEQ ID NO: 1 or 2. Furthermore, a replicon 
RNA comprising a nucleotide sequence derived from the nucleotide sequence 
represented by SEQ ID NO: 1 or 2 by deletion, substitution or addition of I to 50, 
1 to 30, 1 to 10, I to 6, or 1 to several (2 to 5) nucleotides, and being capable of 
autonomous replication is also included in the scope of the present invention as a 
preferred embodiment. In the present invention, "capable of autonomous 
replication" means that when replicon RNA is introduced into a cell, the replicon 
RNA allows its own full-length sequence to be replicated within the cell. For 
example, this ability of autonomous replication can be confirmed by transfecting 
replicon RNA into Huh7 cells, culturing the Huh7 cells, extracting RNA from the 
cells in the thus resulting culture product and conducting Northern blot 
hybridization for the extracted RNA using a probe that can specifically detect the 
transfected replicon RNA so as to detect the presence of the replicon RNA. 
However, examples of such a method are not limited thereto. Specific procedures 
for confirming the ability of autonomous replication can be conducted according 
to descriptions given in the Examples of this specification such as those for 
measuring the ability of colony formation, those for confirming the expression of 
HCV proteins or those for detecting replicon RNA. 
[0024] 

In the present invention, a "selection marker gene" means a gene that can 
provide a cell with selectivity such that only the cell expressing the gene is 
selected. A general example of a selection marker gene is an antibiotic resistance 
gene. In the present invention, preferred examples of a selection marker gene 
include a neomycin resistance gene, a thymidine kinase gene, a kanamycin 
resistance gene, a pyrithiamine resistance gene, an adenylyl transferase gene, a 
Zeocin resistance gene and a puromycin resistance gene, The neomycin 
resistance gene and the thymidine kinase gene are preferred, and the neomycin 
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resistance gene is more preferred. However, the selection marker gene in the 

present invention is not limited to these genes. 

[0025] 

Furthermore in the present invention, a "reporter gene" means a marker 
gene encoding a gene product that is a marker for the expression of the gene. 
General examples of a reporter gene include structural genes of enzymes that 
catalyze light emitting reaction or color reaction. Preferred examples of the 
reporter gene in the present invention include a transposon Tn9-derived 
chloramphenicol acetyltransferase gene, an Escherichia coli-derived p 
glucuronidase or p galactosidase gene, a luciferase gene, a green fluorescence 
protein gene, an aequorin gene from jellyfish, and a secreted placental alkaline 
phosphatase (SEAP) gene. However, the reporter gene in the present invention is 
not limited to these genes. 
[0026] 

Either only one or both of the above selection marker gene and reporter 
gene may be contained in replicon RNA. 
[0027] 

In the present invention, "IRES sequence" means an internal ribosome 
entry site that allows translation to be initiated by binding ribosomes within the 
inside of RNA. Preferred examples of IRES sequence in the present invention 
include, but are not limited to, EMCV IRES (the internal ribosome entry site of 
encephalomyocarditis virus), FMDV IRES and HCV IRES. EMCV IRES and 
HCV IRES are more preferred, and EMCV IRES is the most preferred sequence. 
[0028] 

The replicon RNA according to the present invention may further contain a 
sequence on the genomic RNA of another HCV strain or HCV of another genotype. 
For example, the replicon RNA may also contain a fragment of HCV genome of 
genotype lb. Examples of another HCV strain include, but are not limited to, 
HCV-1, HCV-H, HC-J1, HCT-18, H77, DK-7, US II, SI 4, HCT23, HCV-Th, DR1, 
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DR4, HCT27, S18, SW1, DK9, H90, TD-6EI, S9, HCV-BK, T10, DK1, HC-J4, 
HCV-J, HK3, HK8, HK5, HCV-G3, IND5, IND8, P10, Dl, D3, SW2, T3, S45, 
SA10, US 6, HCV-JK1, HCV-JK4, HCV-JK3, HCV-JK2, HCV-JT, HC-J2, HCV-T, 
HK4, HC-G9, Zl, Bi, S. I., Cho, J.M., HCV-J6, T4, T9, US 10, HC-J5, T2, HC-J7, 
DK11, SW3, DK8, T8, HC-J8, S83, HK2, HC-J6, HC-J8, BEBE1 , HCV-J6, HCV- 
J8, HD10-2, BR36-9, S52, S54, S2, BR33-1, HK10, DK12, HCV-TR, BA-1, BA-2, 
DK13, Zl, Z4, Z6 5 Z7, HK2, SAl, SA4, SA5, SA7, SA13, SA6, NZL1, SA30, EG- 
13, HCV-K3a/650, ED43, EUH1480, EUHK2, T11580, VN235, VN405, VN004, 
JK049, JK046, JFH-1, JCH-1, JCH-2, JCH-3, JCH-4, JCH-5, JCH-6, J6CF and 
H77. 
[0029] 

The replicon RNA according to the present invention preferably has the 5' 
untranslated region on the genomic RNA of HCV of genotype 2a on the 5'-most 
side, and the 3 'untranslated region on the genomic RNA of HCV of genotype 2a 
on the 3 '-most side. A selection marker gene or a reporter gene may be ligated 
upstream of the IRES sequence, or upstream or downstream of "the sequence 
encoding NS3 protein, NS4A protein, NS4B protein, NS5A protein and NS5B 
protein," or inserted in the middle of "the sequence encoding NS3 protein, NS4A 
protein, NS4B protein, NS5A protein and NS5B protein." 
[0030] 

The replicon RNA according to the present invention more preferably has 
the 5' untranslated region on the genomic RNA of HCV of genotype 2a on the 5'- 
most side, and a selection marker gene or a reporter gene, the IRES sequence and 
"the sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A protein 
and NS5B protein" downstream of the 5' untranslated region in this order, and the 
3' untranslated region on the genomic RNA of HCV of genotype 2a on the 3'-most 
side. 
[0031] 

Examples of the replicon RNA according to the present invention may 
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include an RNA containing any foreign gene to be expressed within a cell into 
which the replicon RNA is introduced, in addition to the sequences as described 
above. A foreign gene may also be ligated downstream of the 5' untranslated 
region, or ligated upstream or downstream of a selection marker gene or a reporter 
gene, or ligated upstream or downstream of "the sequence encoding NS3 protein, 
NS4A protein, NS4B protein, NS5A protein and NS5B protein," or may be inserted 
in the middle of "the sequence encoding NS3 protein, NS4A protein, NS4B protein, 
NS5A protein and NS5B protein." A replicon RNA containing a foreign gene can 
express a protein encoded by the foreign gene when it is translated within a cell 
into which the RNA is introduced. Thus, the replicon RNA containing a foreign 
gene can be appropriately used also for gene therapy or the like, the purpose of 
which is to generate a particular gene product within a cell. 
[0032] 

The replicon RNA according to the present invention may further contain a 
ribozyme. A ribozyme is inserted to Iigate a selection marker gene, a reporter 
gene or a foreign gene on the 5' side in the replicon RNA to those located on the 3' 
side thereof including the IRES sequence and "the sequence encoding NS3 protein, 
NS4A protein, NS4B protein, NS5A protein and NS5B protein," so that it enables 
cleavage and separation of the two by the self-cleavage activity of the ribozyme. 
[0033] 

In the replicon RNA according to the present invention, the above 
described selection marker gene, reporter gene, sequences encoding viral proteins 
on the genomic RNA of hepatitis C virus of genotype 2a, sequences encoding viral 
proteins of HCV of a genotype other than genotype 2a, a foreign gene or the like 
are ligated so that they are translated from the replicon RNA in the correct reading 
frame. Among these sequences, the protein-coding sequences may be ligated to 
each other via a protease cleavage site and the like, so that after the proteins are 
expressed as a fusion protein with the polyprotein that is translated from "the 
sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A protein and 
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NS5B protein" of hepatitis C virus of genotype 2a, the fusion protein is separated 

by protease into each protein. 

[0034] 

2. Preparation of replicon RNA according to the present invention 

The HCV RNA-replicon according to the present invention can be prepared 
using any genetic engineering techniques known by persons skilled in the art. 
The HCV RNA-replicon can be prepared by, for example, the following method, 
but the method of preparation is not limited thereto. 
[0035] 

First, DNA corresponding to the entire region of the genomic RNA of 
hepatitis C virus of genotype 2a is ligated downstream of an RNA promoter 
according to a standard procedure so as to prepare a DNA clone. As used herein, 
"DNA corresponding to RNA" means a DNA having a nucleotide sequence derived 
from the nucleotide sequence of the RNA by substituting U (uracil) with T 
(thymine). The above RNA promoter is preferably an RNA promoter contained in 
a plasmid clone. An example of an RNA promoter is not limited, but T7 RNA 
promoter is particularly preferred. 
[0036] 

Next, for the thus prepared DNA clone, for example, the structural region 
(Core sequence, El sequence and E2 sequence) located downstream of the 5' 
untranslated region and the sequence encoding NS2 protein are substituted with a 
DNA fragment containing a selection marker gene or a reporter gene and the IRES 
sequence ligated downstream thereof. In this substitution, portions other than the 
structural region, such as a fragment on the 3' terminal side of the 5' untranslated 
region or a part of the sequence encoding NS3 protein may be substituted with a 
sequence derived from HCV of another genotype. 
[0037] 

Subsequently, using the DNA clone after the substitution as a template, 
RNA is synthesized using RNA polymerase, RNA synthesis can be initiated by a 
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standard procedure from the 5' untranslated region and the IRES sequence. When 
a template DNA is a plasmid clone, the above DNA region ligated downstream of 
an RNA promoter is excised by a restriction enzyme from the plasmid clone, and 
then RNA can be synthesized using the DNA fragment as a template. In addition, 
preferably the 3' terminus of RNA to be synthesized agrees with the 3 1 untranslated 
region of the viral genomic RNA, and no other sequences are added or deleted. 
The thus synthesized RNA is the replicon RNA according to the present invention, 
[0038] 

3, Preparation of replicon-replicating cells into which replicon RNA from HCV of 
genotype 2a is introduced 

The replicon RNA that is prepared as described above is introduced into 
cells in which the replicon RNA should be replicated, so that cells wherein the 
replicon RNA is continuously replicated can be obtained. In this specification, a 
cell wherein replicon RNA is continuously replicated is referred to as a "replicon- 
replicating cell." 
[0039] 

As a cell into which replicon RNA is introduced, any cell can be used, as 
long as it can be subcultured. Such a cell is preferably a eukaryotic cell, more 
preferably a human liver-derived cell, a human uterine cervix-derived cell or a 
human fetal kidney-derived cell, and further preferably any cell selected from the 
group consisting of Huh7 cells, HepG2 cells, IMY-N9 cells, HeLa cells and 293 
cells. As these cells, commercially available cells may be utilized, these cells 
may be obtained from cell depositories, or cell lines established from any cells 
(e.g., cancer cells or stem cells) may also be used. 
[0040] 

As the above cells, cells that can be mass-cultured are preferably used for 
the purpose of the mass production of HCV proteins, such as in the case of vaccine 
production, From such a viewpoint, the cells are preferably those other than 
Huh7 cells. 
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[0041] 

Introduction of replicon RNA into cells can be performed using any 
technique known by persons skilled in the art. Examples of such an introduction 
method include electroporation, a particle gun method, a lipofection method, a 
calcium phosphate method, a microinjection method and a DBAE sepharose 
method. The method using electroporation is particularly preferred. 
[0042] 

A replicon RNA of interest may be introduced alone, or may be introduced 
after it is mixed with other nucleic acids. To vary the quantity of replicon RNA 
while keeping RNA quantity to be introduced at a certain level, the replicon RNA 
of interest is mixed with total cellular RNA extracted from cells into which the 
RNA is introduced, and then the mixture is used for introduction into cells. The 
quantity of replicon RNA to be used for introduction into cells may be determined 
depending on the introduction method employed, and is preferably between 1 
picogram and 100 micrograms, and more preferably between 10 picograms and 10 
micrograms. 
[0043] 

When replicon RNA containing a selection marker gene or a reporter gene 
is used for introduction into cells, cells wherein the replicon RNA is introduced 
and continuously replicated can be selected utilizing the expression of the 
selection marker gene or the reporter gene. Specifically, for example, such cells 
into which replicon RNA has been introduced may be cultured in media whereby 
the cells can be selected by the expression of the selection marker gene or the 
reporter gene. As an example, when replicon RNA contains a neomycin 
resistance gene as a selection marker gene, cells into which replicon RNA has 
been intracellularly introduced are seeded into a culture dish. After 16 to 24 
hours of culture, G4I8 (neomycin) is added to the culture dish at a concentration 
of 0,05 milligrams/mi Hi liter to 3.0 milligrams/milliliter. The cells are 
continuously cultured for preferably 10 days to 40 days and more preferably 14 
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days to 28 days after seeding, while exchanging the culture solution twice a week. 
Next, surviving cells are stained with crystal violet, so that cells into which the 
replicon RNA has been introduced and is being continuously replicated can be 
selected as formed colonies. 
[0044] 

Cloned cells can be obtained from the formed colonies by cloning 
surviving cells by a standard procedure, and then continuing the culture of the 
cells. The thus obtained cell clone wherein the replicon RNA of interest is 
continuously replicated is referred to as "a replicon-rep Heating ceil clone" in this 
specificat ion. 
[0045] 

Regarding the established cell clone, detection of a replicon RNA that has 
been replicated from the introduced replicon RNA in the cell clone, confirmation 
of the presence or the absence of the incorporation of a selection marker gene or a 
reporter gene in the introduced replicon RNA into a host genomic DNA, and 
confirmation of the expression of an HCV protein are preferably carried out to 
confirm the fact that a replicon RNA of interest is actually and continuously 
replicated. 
[0046] 

A replicon RNA that has been replicated from the introduced replicon RNA 
in the ceil clone (in this specification, hereinafter conveniently referred to as 
"replicated replicon RNA") may be detected according to any RNA detection 
method known by persons skilled in the art. For example, detection can be 
performed by conducting the Northern hybridization method for total RNA 
extracted from the ceil clone using as a probe a DNA fragment specific to the 
introduced replicon RNA. 
[0047] 

Furthermore, the presence or the absence of the incorporation of a 
selection marker gene or a reporter gene in the introduced replicon RNA into a 
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host genomic DNA can be confirmed by, for example, performing PCR for the host 
genomic DNA extracted from the cell clone to amplify at least a part of the 
selection marker gene or the reporter gene, and then confirming the presence or 
the absence of the amplified product. However, examples of relevant methods 
are not limited thereto. A cell clone for which the amplified product is confirmed 
is considered to have a selection marker gene or a reporter gene incorporated in 
the host genome. Thus, regarding the cell clone, the replicon RNA itself may not 
be continuously replicated within the cell. In this case, whether or not the 
replicon RNA is continuously replicated can be confirmed by conducting an 
experiment to confirm the expression of an HCV protein, as described below. 
[0048] 

The expression of an HCV protein can be confirmed by, for example, 
causing an antibody against an HCV protein to be expressed from the introduced 
replicon RNA and to react with a protein extracted from a cell clone. This 
method can be conducted by any protein detection method known by persons 
skilled in the art. Specifically, for example, a protein sample extracted from the 
cell clone is blotted onto a nitrocellulose membrane, with which an anti-HCV 
protein antibody (e.g., an anti-NS3-specific antibody or an antiserum collected 
from a hepatitis C patient) is reacted, and then the anti-HCV protein antibody is 
detected. If the HCV protein is detected among proteins extracted from the cell 
clone, it can be concluded that this cell clone continuously replicate HCV-derived 
replicon RNA to express the HCV protein. 
[0049] 

As described above, cell clones confirmed to continuously replicate a 
replicon RNA of interest (replicon-replicating cell clones) can be obtained. 
Furthermore in the present invention, replicon RNA can be obtained by any 
method known by persons skilled in the art, for example, by extracting RNA from 
the replicon-replicating cell, and then separating replicon RNA from the RNA by 
an electrophoresis method. The present invention also relates to such a method 
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of producing replicon RNA. Moreover, preferably, the replicon-replicating cell 
according to the present invention can be used for producing HCV proteins. 
Persons skilled in the art can obtain HCV proteins from the replicon-replicating 
cells according to any standard method. Specifically, for example, a viral protein 
of hepatitis C virus of genotype 2a can be produced by culturing replicon- 
replicating cells, collecting proteins from the resulting culture product (including 
cultured cells and culture media) by a standard procedure, and then selectively 
obtaining viral proteins from the proteins by detection or the like using an anti- 
HCV protein antibody, 
[0050] 

Moreover, when the replicon-replicating cell according to the present 
invention continuously replicates replicon RNA containing a foreign gene, a 
protein encoded by the foreign gene can be obtained by the expression thereof 
using the replicon-replicating cell. Specifically, for example, the protein encoded 
by a foreign gene can be obtained by culturing replicon-replicating cells, 
collecting proteins from the resulting culture product (including cultured cells and 
culture media) by a standard procedure, and then selectively obtaining the protein 
from among the proteins by detection or the like using an antibody against the 
protein of interest. 
[0051] 

4. Introduction of mutation that increases replication efficiency into replicon RNA 
from HCV of genotype 2a 

Mutation producing enhancement of replication efficiency frequently takes 
place in the replicon RNA that is replicated or generated in the replicon- 
replicating cell (replicated replicon RNA) according to the present invention. 
Such a mutation may be an adaptive mutation. 

Utilizing this fact, introduction of a mutation enhancing replication 
efficiency into the replicon RNA according to the present invention can be 
promoted in the present invention. 
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[0052] 

Specifically, the step comprising obtaining a first replicated replicon RNA 
by extraction or the like from a first replicon-replicating cell (preferably, a 
replicon-replicating cell, wherein the replicon RNA according to the present 
invention has been introduced), and then re- introducing the first replicated 
replicon RNA into another cell to prepare a second replicon-replicating cell is 
performed repeatedly once or more, preferably 1 to 10 times, more preferably 1 to 
5 times, and further preferably 1 to 2 times, so that the mutation increasing 
replication efficiency can be introduced at a high frequency into the replicon RNA 
within the replicon-replicating cells. 
[0053] 

As a cell into which a replicated replicon RNA is re-introduced, any cell 
can be used. Such a cell is preferably derived from a biological species that is 
the same as that of a cell wherein replicon RNA is introduced at the beginning, 
more preferably derived from the same tissue derived from the same biological 
species as that of a cell wherein replicon RNA is introduced at the beginning, and 
further preferably of a cell line that is the same as that for a cell wherein replicon 
RNA is introduced at the beginning. 
[0054] 

Therefore in the present invention, using the above method, replicon RNA 
wherein the mutation increasing replication efficiency is introduced can be 
produced. Specifically, the step comprising obtaining a first replicated replicon 
RNA by extraction or the like from a first replicon-replicating cell (preferably, a 
replicon-replicating cell, into which the replicon RNA according to the present 
invention has been introduced), and then re-introducing the first replicated 
replicon RNA into another cell so as to prepare a second replicon-replicating cell 
is performed repeatedly once or more, preferably 1 to 10 times, more preferably 1 
to 5 times, and further preferably 1 to 2 times. Subsequently, the replicated 
replicon RNA is obtained by extraction or the like from the replicon-replicating 
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cell finally obtained at the end of the repeated steps, so that replicon RNA with 

increased replication efficiency can be produced. 

[0055] 

In the present invention, the replication efficiency of a replicon RNA can 
be increased at least 2 times, preferably 10 to 100 times, and more preferably 100 
to 10000 times by the above method. 
[0056] 

Regarding the replicon RNA that is produced by such a method so as to 
have increased replication efficiency, the nucleotide sequence is preferably 
determined by a known method, for example, by obtaining cDNA by reverse 
transcription PCR and subjecting such cDNA to sequencing. Furthermore, the 
thus determined nucleotide sequence or the amino acid sequence encoded by the 
nucleotide sequence is compared with the nucleotide sequence of replicon RNA 
that had been introduced at the beginning into cells, so that adaptive mutation can 
be identified. As adaptive mutation increasing replication efficiency, in 
particular, nonsynonymous substitution that mutates an amino acid in a viral 
protein encoded by replicon RNA is preferred. 
[0057] 

The present invention also provides a method whereby the replicon RNA 
of hepatitis C virus of genotype 2a having increased replication efficiency can be 
produced by introducing the thus identified adaptive mutation into replicon RNA, 
the replication efficiency of which is to be increased, by a standard procedure. 
[0058] 

The replicon RNA that is produced as described above so as to have 
increased replication efficiency can be used for producing replicon RNA in large 
quantity within cells that have been used for the method. 
[0059] 

The replication efficiency of the replicon RNA according to the present 
invention can be determined by a method known by persons skilled in the art. 
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For example, it can be determined according to the following method. Replicon 
RNAs are transfected in quantities of 0.0001, 0.0003, 0.001, 0.003, 0.01, 0.03, 0.1, 
0.3 and 1.0 micrograms, respectively, into Huh7 cells, selective culture with G418 
is performed for 21 days in a method similar to the above experimental techniques, 
and then the number of colonies formed (number of colonies) is counted. The 
quantity of RNA introduced is compared with the number of colonies formed to 
determine the range of the quantity of the replicon RNA introduced, within which 
colony formation increases in a quantity-dependent manner. The number of 
colonies formed within the range is divided by the quantity of RNA introduced, 
and the resulting value is regarded as the colony forming activity per microgram. 
This equation is as follows. 

Colony forming activity [(Colony Forming Unit, or CFU)/microgram] = 
Number of colonies formed [colony] / quantity of RNA introduced [microgram] 
[0060] 

The thus calculated colony forming activity is regarded as a value 
representing the replication efficiency of replicon RNA introduced. Specifically, 
the higher the colony forming activity, the higher the replication efficiency of the 
replicon RNA, In addition, the replication efficiency of replicon RNA can also 
be shown via a colony- forming ability that is represented by the number of copies 
of the replicon RNA introduced per formed colony. That is, in this case, the 
ability can be calculated according to the following equation. 

Colony forming ability = number of copies of replicon RNA introduced 
[copy] / number of formed colonies [colony] 
[0061] 

5. Other embodiments of the present invention 

The replicon RNA-replicating cell according to the present invention can 
also be used as a test system for, for example, screening for a substance that 
promotes or suppresses the replication of hepatitis C virus. Specifically, for 
example, replicon replicating cells are cultured in the presence of a test substance, 
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replication of the rep 1 icon RNA in the resulting culture product is detected, and 
then whether or not the test substance promotes or suppresses the replication of 
the repiicon RNA is determined, so that a substance that promotes or suppresses 
the replication of hepatitis C virus can be screened for. In this case, detection of 
the replication of the repiicon RNA in the resulting culture product may be 
conducted by detecting the quantity of, or the presence or the absence of, the 
repiicon RNA in the RNAs extracted from the repiicon RNA-replicating cell, or by 
detecting the quantity of, or the presence or the absence of, HCV protein contained 
in the proteins in the culture product or in the repiicon RNA-replicating cells 
contained in the culture product. 
[0062] 

Such a test cell system using the repiicon RNA-replicating cells according 
to the present invention may be aimed at producing or evaluating a therapeutic 
agent or a diagnostic agent for treating hepatitis C virus infection. Specific 
examples of such purposes include the following examples. 
[0063] 

(1) Search for a substance suppressing the proliferation of HCV of genotype 2a 

Examples of a substance suppressing the proliferation of HCV of genotype 
2a include organic chemicals directly or indirectly affecting the proliferation of 
HCV of genotype 2a, and antisense oligonucleotides directly or indirectly 
affecting the proliferation of HCV or the translation of HCV proteins by 
hybridizing to a target sequence in the HCV genome of genotype 2a or a 
complementary strand thereof. 

(2) Evaluation of various substances having antiviral action in cell culture 

Examples of the various substances include substances obtained through 
rational drug design or high throughput screening (e.g., an isolated and purified 
enzyme) and the like. 

(3) Identification of a new target for attack for treating patients infected with HCV 
of genotype 2a 
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To identify a host cellular protein that plays an important role in 
proliferation of HCV virus, for example, the repUcon-repltcating cell according to 
the present invention can be used. 

(4) Evaluation of the ability of HCV virus to acquire resistance against a drug or 
the like and identification of mutation concerning such resistance 

(5) Production of a viral protein as an antigen that can be used for developing, 
producing and evaluating a diagnostic agent or a therapeutic agent for hepatitis C 
virus infection 

(6) Viral genome replication system for producing HCV virus or virus-like 
particles that can be used for developing, producing and evaluating a diagnostic 
agent or a therapeutic agent for hepatitis C virus infection 

(7) Production of a vaccine antigen that can be used as a vaccine against HCV of 
genotype 2a 

(8) Production of hepatic cell-directed genetic vector that is used after the 
incorporation of a foreign gene therein for gene therapy 

[Examples] 
[0064] 

The present invention will be described more specifically based on the 
following examples and drawings. However, the technical scope of the present 
invention is not limited by these examples. 
[0065] 

[Example 1] Preparation of replicon RNA 
(A) Construction of expression vector 

DNA corresponding to the entire region of viral genome of hepatitis C 
virus JFH-1 strain (genotype 2a) that had been separated from patients with 
fulminant hepatic failure was obtained from a JFH-1 clone containing the full- 
length genomic cDNA of the virus strain. The DNA was inserted downstream of 
T7 RNA promoter sequence that had been inserted in pUC19 plasmid, The thus 
constructed plasmid DNA is hereinafter referred to as pJFHl. Similarly, DNA 
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corresponding to the entire region of viral genome of hepatitis C virus JCH-1 
strain (genotype 2a) that had been separated from patients with chronic hepatitis 
was obtained from a JCH-1 clone containing the full-length genomic cDNA of the 
virus strain. The DNA was inserted downstream of the T7 RNA promoter 
sequence that had been inserted in pUC19 plasmid. The thus constructed plasmid 
DNA is hereinafter referred to as pJCHl. In addition, the preparation of the 
above JFH1 clone and JCH-1 clone is described in Patent Literature 1 and Non 
Patent Literature 3. Moreover, the nucleotide sequence of the full-length cDNA 
of JFH-1 clone was registered at the International DNA Data Bank 
(DDBJ/EMBL/GenBank) under accession No. AB047639, and the nucleotide 
sequence of the full-length cDNA of the JCH-1 clone under accession No. 
AB047640, 
[0066] 

The structures of the thus constructed plasmid DNA pJFHl and pJCHl are 
shown in the upper section of Fig. I. "T7" represents T7 RNA promoter, and "G" 
represents dGTP inserted upstream of the 5* end of the inserted JFH-1- or JCH-1- 
derived DNA and downstream of the 3' end of T7 RNA promoter sequence. A 
region from "5' NTR" to "3 1 NTR" is DNA corresponding to the entire genomic 
region of hepatitis C virus. 
[0067] 

Next, the structural regions and a part of the non-structural regions of 
plasmid DNA pJFHl and pJCHl were substituted with a neomycin resistance gene 
(neo; also referred to as a neomycin phosphotransferase gene) and EMCV-IRES 
(internal ribosome entry site of encephalomyocarditis virus), thereby constructing 
plasmid DNA pSGREP-JFHl and pSGREP-JCHl, respectively (lower section of 
Fig, 1), This construction procedure was conducted according to a previous 
report (Non Patent Literature 7). Specifically, plasmid pJFHl and pJCHl were 
cleaved with restriction enzymes Age I and Cla I, and between the Age I and Cla I 
restriction sites, the following fragments were inserted to be ligated; a fragment 
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was prepared by binding of a sequence ranging from 5' NTR to Core region 
derived from pJFH-1 with the neomycin resistance gene derived from pRSV5NEO 
by PCR amplification and then cleaving it with restriction enzymes Age I and Pme 
I, and, a fragment was prepared by binding of sequences ranging from EMCV 
IRES to NS3 region by PCR amplification and then cleaving it with restriction 
enzymes Pme I and Cia I. 
[0068] 

Moreover, a mutation that mutates an amino acid motif GDD to GND, 
corresponding to the active center of RNA polymerase encoded by the NS5B 
region, was introduced into the NS5B region in pSGREP-JFHl, thereby preparing 
a mutant plasmid clone pSGREP-JFHl/GND. 
[0069] 

Moreover, a mutation that results in the deletion of a sequence of 10 
continuous amino acids containing an amino acid motif GDD corresponding to the 
active center of RNA polymerase encoded by the NS5B region was introduced into 
the NS5B region in pSGREP-JFHl, thereby preparing a mutant plasmid clone 
pSGREP-JFHl/dGDD. 
[0070] 

The above-prepared mutant clones pSGREP-JFHl/GND and pSGREP- 
JFHl/dGDD cannot express active NS5B protein, which is required for the 
replication of replicon RNA, because the amino acid sequence of the active site of 
NS5B protein encoded by these clones has mutated. 
[0071] 

(B) Preparation of replicon RNA 

To prepare template DNA for use in synthesis of replicon RNA, the above- 
constructed expression vectors pSGREP-JFHl, pSGREP-JCHl, pSGREP- 
JFH1/GND and pSGREP-JFHl/dGDD were each cleaved with a restriction enzyme 
Xba I. 
[0072] 
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Subsequently, 10 to 20 u,g each of these Xba I-cleaved fragments was 
contained in 50 p.1 of a reaction solution, and then further treated by 30 minutes of 
incubation at 30°C with 20 U of Mung Bean Nuclease. Mung Bean Nuclease is 
an enzyme catalyzing a reaction for selectively degrading a single-stranded 
portion of double-stranded DNA, Generally, when RNA synthesis is performed 
using directly the above Xba I-cleaved fragment as a template, a replicon RNA 
having four nucleotides of CUGA, a part of the recognition sequence of Xba I, 
excessively added to the 3' terminus would be synthesized. Hence, in this 
example, Xba I-cleaved fragments were treated with Mung Bean Nuclease, so as to 
remove the four nucleotides of CUGA from the fragments. The solutions 
containing Xba I-cleaved fragments, which had been treated with Mung Bean 
Nuclease, were treated to remove proteins according to a general method, so that 
Xba I-cleaved fragments, from which the four nucleotides of CUGA had been 
removed, were purified and used as template DNAs. 
[0073] 

Next, from the template DNA, RNA was synthesized in vitro using T7 
RNA polymerase. For this RNA synthesis, MEGAscript from Ambion, Inc. was 
used. Reaction was carried out using 20 \x\ of a reaction solution containing 0.5 
to 1.0 micrograms of the template DNA according to the instructions of the 
manufacturer. 
[0074] 

After completion of RNA synthesis, DNase (2 U) was added to the reaction 
solution to conduct reaction at 37°C for 15 minutes. RNA extraction using acidic 
phenol was further performed to remove the template DNA. RNAs (replicon 
RNAs) synthesized in this manner from the above template DNAs derived from 
pSGREP-JFHl, pSGREP-JCHl, pSGREP-JFHl/GND and pSGREP-JFHl/dGDD 
were respectively named rSGREP-JFHl, rSGREP-JCHl, rSGREP-JFHl/GND and 
rSGREP-JFHl/dGDD. Regarding the nucleotide sequences of these replicon 
RNAs, the nucleotide sequence of rSGREP-JFHl is shown in SEQ ID NO: 1 and 
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Fig. 2, that of rSGREP-JCHl is shown in SEQ ID NO: 2 and Fig. 3, that of 
rSGREP-JFHl/GND is shown in SEQ ID NO: 7, and that of rSGREP-JFHl/dGDD 
is shown in SEQ ID NO: 8. 
[0075] 

[Example 2] Establishment of replicon-replicating cell clone 

(C) Transfection of replicon RNA, determination of colony-forming ability of 
transfected cells and establishment of cell clones 

Each of the above-synthesized replicon RNAs (rSGREP-JFHl, rSGREP- 
JCH1, rSGREP- JFH 1 /GND and rSGREP-JFHl/dGDD) was mixed in different 
quantities with total cellular RNA extracted from Huh7 cells so as to have a total 
RNA quantity of 10 jig. Subsequently, the mixed RNA was introduced into Huh7 
cells by the electro poration method. The Huh7 cells subjected to the 
electroporation treatment were seeded into culture dishes, and then cultured for 16 
hours to 24 hours. G418 (neomycin) was then added to the culture dishes at 
different concentrations. Thereafter, culture was continued while exchanging the 
culture solutions twice a week. After 21 days of culture following seeding, 
surviving cells were stained with crystal violet. The number of stained colonies 
was counted, and then the number of colonies obtained per jag of the transfected 
replicon RNA was calculated. 
[0076] 

For tSGREP-JFHI or rSGREP-JCHI -transfected cells, for which colony 
formation had been observed, colonies of the surviving cells were further cloned 
from the above culture dishes after 21 days of culture, and were continuously 
cultured. By such cloning of colonies, several strains of cell clones could be 
established. 
[0077] 

For the established cell clones, detection of the replicated replicon RNA, 
confirmation of the presence or the absence of the incorporation of the neomycin 
resistance gene into the host genomic DNA, and confirmation of the expression of 
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HCV proteins were performed as described later, in Example 4. Cell clones for 
which the replication of the replicon had been confirmed in the cells were 
regarded as replicon-replicating cell clones. 
[0078] 

(D) Colony-forming ability in each transfected cell 

As a result of the above transfection, for rSGREP-JFHl -transfected Huh7 
cells, the colony- forming ability per p.g of the transfected replicon RNA was 
94700 CPU (Colony Forming Unit)/u.g-RNA when G418 concentration was 1.0 
mg/ml (the left column in Fig. 4). In contrast, colony formation was not 
observed in the Huh7 cells, into which rSGREP-JFHl/dGDD and rSGREP- 
JFH1/GND had each been transfected (the central column and the right column in 
Fig. 4). This suggests that the colony-forming ability confirmed for the Huh7 
cells, into which rSGREP-JFHl replicon RNA had been transfected, depends on 
the activity of NS5B (RNA polymerase) expressed by rSGREP-JFHl. 
Specifically, it was considered that in cells that had formed colonies, rSGREP- 
JFH1 replicon RNA autonomously replicated due to the action of NS5B expressed 
by rSGREP-JFHl, and the neomycin resistance gene was continuously expressed 
to maintain G418 resistance, so that cell growth was enabled, 
[0079] 

On the other hand, in the Huh7 cells, into which rSGREP-JCHl had been 
transfected, no colony formation was observed in the case of 1 to 0.5 mg/ml G418 
concentrations (Fig. 5). When G418 concentration was lowered to 0,25 mg/ml, 
colony formation was observed in the Huh7 cells, into which rSGREP-JCHl had 
been transfected as well. 
[0080] 

Furthermore, Xba I-cleaved fragment of the expression vector pSGREP- 
JFH1 obtained in (B) above was used as a template DNA for RNA synthesis 
without treating the fragment with Mung Bean Nuclease, so as to synthesize 
replicon RNA. This replicon RNA was transfected to Huh7 cells in a manner 
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similar to that in (C) above. The repltcon RNA that had been prepared without 
performing Mung Bean Nuclease treatment had the four nucleotides of CUGA 
excessively added to the 3' terminus. 
[0081] 

As a result, the colony- forming ability of the Huh7 cells, into which the 
repHcon RNA prepared without treatment with Mung Bean Nuclease had been 
transfected, decreased to 512 CFU/ug-RNA (the left side in Fig. 6). This result 
revealed that the sequence on the 3' terminus of the replicon RNA affects the 
colony-forming ability of the transfected cells. 
[0082] 
[Example 3] 

(E) Re-transfection of replicated replicon RNA derived from replicon-replicating 
cells 

From the replicon-replicating ceil clones that had been established by 
transfection of rSGREP-JFHl into Huh7 cells according to descriptions of 
Example 2, total RNA was extracted by a standard procedure. The number of 
copies of the replicated replicon RNA contained in the cellular RNA was 
determined by Northern blot analysis and a quantitative RT-PCR method. 
[0083] 

Northern blot analysis was performed according to the description in 
Molecular Cloning, A laboratory Manual, 2 nd edition, J. Sambrook, E. F, Fritsch, T. 
Maniatis, Cold Spring Harbor Laboratory Press (1989). Specifically, RNA 
extracted from the cells was subjected to denaturing agarose electrophoresis. 
After electrophoresis, the RNA was transferred onto a positively charged nylon 
membrane. The 32 P-labeled DNA or RNA probe prepared from pSGREP-JFHI 
was hybridized to the RNA transferred to the membrane as described above. 
Next the membrane was washed, and then exposed to a film, so as to detect a 
replicon-specific RNA band. 
[0084] 
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Detection of the replicon RNA by quantitative RT-PCR was conducted by 
detecting the 5' untranslated region RNA within HCV RNA according to Takeuchi 
T, Katsume A, Tanaka T, Abe A, Inoue K, Tsukiyama-Kohara K, Kawaguchi R, 
Tanaka S and Kohara M., Real-time detection system for quantification of 
Hepatitis C virus genome, Gastroenterology 116: 636-642 (1999). Specifically, 
the replicon RNA contained in RNA extracted from the cells was amplified by 
PCR using synthetic primers: R6-130-S17, 5'-CGGGAGAGCCATAGTGG-3' (SEQ 
ID NO: 13) and R6-290-R19, 5'-AGTACCACAAGGCCTTTCG-3' (SEQ ID NO: 
14); TaqMan Probe; R6-148-S21FT, 5'-CTGCGGAACCGGTGAGTACA03' (SEQ 
ID NO: 15) and an EZ rTth RNA PCR kit, and then detected using an ABI Prism 
7700 sequence detector system. 
[0085] 

Next, aliquots of total cellular RNAs extracted from clone 6 (among the 
above-mentioned replicon-replicating cell clones) and pool clones (prepared by 
collecting replicon-replicating cells that had formed colonies from whole one dish 
and culturing them) were each introduced into another Huh7 cells by 
retransfection. Total cellular RNA used for the transfection was prepared to 
contain IxlO 7 copies of replicon RNA based on the number of copies of the above- 
determined replicon RNA. Transfection was performed as described in (C) above, 
and then selective culture was performed under G418 concentration conditions of 
I mg/ml. Thus, the colony formation of the replicon-replicating cells was 
observed (Fig. 7). The colony-forming ability in this case was 1 colony or more 
per lxl 0 6 copies of the replicon RNA used for transfection, when it was calculated 
from the number of colonies obtained. 
[0086] 

On the other hand, the number of copies of in vitro synthetic RNA that had 
been synthesized in vitro using pSGREP-JFHl as a template and T7 RNA 
polymerase was approximately 2x1 0 11 copies/u.g-RNA, when calculated based on 
the weight and the length of the RNA. The colony- forming ability in the case of 
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using the in vitro synthetic RNA for transfection in a manner similar to the above 
method was 1 colony per 5x1 0 7 copies. These results revealed that when RNA 
derived from cells extracted from replicon-replicating cells and in vitro synthetic 
RNA were each transfected to Huh7 cells as rep 1 icon RNA in the same number of 
copies, the use of the replicon RNA replicated within Huh7 cells resulted in 
colony- forming ability approximately 50 times higher than that of the in vitro 
synthetic RNA. 
[0087] 
[Example 4] 

(F) Detection of replicon RNA 

According to (E) above, cell clones [clones Nos. 1 to 11] were established 
by retransfection of total RNA that had been obtained from the replicon- 
replicating cell clone established by transfection of rSGREP-JFHI to Huh7 cells to 
another Huh7 cells. From the established cell clones and pool clones (prepared 
by collecting cell clones that had formed colonies from whole one dish and then 
culturing them), respectively, total RNAs were extracted by an acidic phenol 
extraction method. Subsequently the total RNAs were analyzed by the Northern 
blot method using a pSGREP-JFHl-specific probe as a probe, As control, total 
RNA extracted similarly from untransfected Huh7 cells (in Fig. 8, denoted as 
"Huh7"), a sample prepared by adding 10 7 copies of replicon RNA synthesized in 
vitro to the total RNA extracted from Huh7 cells (in Fig. 8, denoted as "I0 7 "), and 
a sample (in Fig. 8, denoted as "10 8 ") prepared by adding 10 s copies of replicon 
RNA synthesized in vitro to the total RNA extracted from FIuh7 cells, were used. 
In Fig. 8, 1 to 1 1 represent ceil clone Numbers. 
[0088] 

As a result, RNA of approximately the same size as that of rSGREP-JFHI 
was detected using a pSGREP-JFH I -specific probe (Fig. 8). Thus, it was 
confirmed that the replicon RNA from rSGREP-JFHI that had been transfected at 
the beginning replicated and proliferated within the cell clones. In addition, it 
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was shown that the cell clones differed from each other in the quantity of the 
replicated replicon RNA. In Fig. 8, for example, clones 2, 6, 9 and 10 contained 
high quantities of the replicated replicon RNA, and clones 4, 8 and 11 contained 
low quantities of the replicated replicon RNA. 
[0089] 

(G) Confirmation of the presence or the absence of the incorporation of a 
neomycin resistance gene into genomic DNA 

For the cell clones that had been obtained by retransfection of replicon 
RNA as described in Example 3, PCR amplification was performed vising 
neomycin resistance gene-specific primers; sense primer, NEO-S3: 5'- 
AACAAGATGGATTGCACGCA-3' (SEQ ID NO: 16) and antisense primer, NEO- 
R: 5'-CGTCAAGAAGGCGATAGAAG-3' (SEQ ID NO: 17), and the host cellular 
genomic DNA extracted from each of the cell clones as a template, in order to 
confirm that the resistance of each of the cell clones against G418 was not due to 
the incorporation of the neomycin resistance gene into the genome. The cell 
clones used herein were the cell clones Nos. 1 to 8 obtained by retransfection of 
rSGREP-JFHl-derived replicated replicon RNA (rSGREP-JFHl -derived cell 
clones Nos. 1 to 8), and cell clones Nos. 1 to 6 obtained by retransfection of 
rSGREP-JCHl -derived replicated replicon RNA (rSGREP-JCHl-derived cell 
clones Nos. 1 to 6). As a result, as shown in Fig. 9, in the eight examined 
rSGREP-JFHl -derived cell clones, positive clones showing the amplification of 
the neomycin resistance gene were not observed. For rSGREP-JCHl -derived cell 
clones, only 1 out of the 6 examined clones was positive (in Fig. 9, lane 3 in the 
right photograph). It was considered that this positive clone had acquired G418 
resistance by the incorporation of the neomycin resistance gene in rSGREP-JCHl - 
derived replicated replicon RNA into the genomic DNA of the host cells. Thus, 
in the positive clone, unlike other clones, it was thought that the replicon RNA 
itself did not autonomously replicate within the cells. This was confirmed by the 
results of the experiment shown in the next (H) that no HCV proteins were 
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detected from the positive clone. 
[0090] 

(H) Detection of HCV protein 

Protein was extracted from rSGREP-JFHl- and rSGREP-JCHl-transfected 
cell clones by a standard procedure, and then analyzed by SDS-PAGE and Western 
blot method (Fig. 10). The examined cell clones were the same as those used in 
(G) above: rSGREP-JFHl -derived cell clones Nos. 1 to 8 and rSGREP-JCHl- 
derived cell clones Nos. 1 to 6. In addition, a cellular extract from the cell 
obtained by transiently transfecting expression plasmid DNA containing NS3 gene 
into Huh7 cells was regarded as a positive control (NS3 protein). Furthermore, a 
protein extracted from the untransfected Huh7 cells was used as a negative control. 
A protein sample extracted from each cell clone was blotted onto a PVDF 
membrane (Immobilon-P, Millipore), and then detection of NS3 protein encoded 
by replicated replicon RNA was performed using anti-NS3-specific antibody 
(provided by Dr. Moradpour; Wolk B, et al, J. Virology. 2000, 74: 2293-2304). 
As shown in Fig. 10, in rSGREP-JFHl -derived cell clones Nos. 1 to 8 and 
rSGREP-JCHl -derived cell clones Nos. 1, 2 and 4 to 6, proteins of the same size 
as those of the positive control were detected. In rSGREP-JCHl-derived eel! 
clone No. 3 (the clone detected as a positive clone in (G) above), no expression of 
NS3 protein was detected. That is, in rSGREP-JCHl-derived cell clone No. 3, no 
replication of replicon RNA was confirmed. NS3 protein was not detected in the 
untransfected Huh7 cells, revealing that in cell clones wherein NS3 protein was 
detected, the transfected replicon RNA autonomously replicated so that NS3 
protein was expressed. 
[0091] 

Moreover, by the use of the serum of a hepatitis C patient as an antibody, 
the expression of NS5a protein from the replicon RNA was also confirmed in each 
cell clone for which the expression of NS3 protein had been confirmed as 
described above. 
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[0092] 

Based on the results of (G) and (H) above, it was confirmed that replicon 
RNAs were replicated in the cell clones established by transfection of the replicon 
RNA. 
[0093] 
[Example 5] 

(I) Analysis of adaptive mutation 

According to descriptions of Example 3, total RNA obtained from the 
replicon-replicating cell clones established through the transfection of rSGREP- 
JFH1 into Huh7 cells was re-trans fected to another Huh7 cells, thereby 
establishing 21 cell clones. Total RNA was extracted from each of these cell 
clones by a standard procedure. cDNA corresponding to the replicon RNA was 
synthesized using the total RNA as a template, reverse transcriptase Superscript II 
(Invitrogen) and primer 9641R-IH (5'- 

GCACTCTCTGCAGTCATGCGGCTCACGGAC-3' (SEQ ID NO: 18)). The 



composition of a reaction solution for the synthesis 

transcription reaction is as shown below. 

[0094] 

^nrmv^ifi-n ^B^nn Solution Fluid Volume <ui) 


5x 1st strand Buffer 


4 


2 tnM dNTP 


5 


0.1 M DTT 


1 


9651R-IH primer (100 iiM) 


1 


DW (distilled water) 


6.5 


Sample RNA (2 mg/mL) 


1 


RNasin (Pro mega) (40 U/fiL) 


0.5 


Superscript II RT dnvitroeen} 


1 


Total 
[0095] 


20 ul 
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In cDNA synthesis reaction, the above reagents other than RNasin and 
Superscript II were mixed to prepare a first reaction solution. The solution was 
heated at 90°C for 3 minutes, and then cooled on ice. Subsequently, RNasin and 
Superscript II were added to this reaction solution, and then the solution was 
allowed to react at 42°C for 1 hour, followed by another reaction at 70°C for 15 
minutes. 
[0096] 

Furthermore, PCR amplification was performed using the thus obtained 
cDNA together with five primer sets by the following procedures, so that DNA 
amplification fragments covering almost all the regions of the replicon RNA were 
obtained. The primer sets used and regions amplified by each primer set are 
shown in Table 1 and Table 2 below. 
[0097] 
[Table 1] 



Designation of 
amplified fragment 


Primer set 


Amplified region 


Primer 1 


Primer 2 


A/ 


42S-IH 


433R-neo 


41 - 470 


B/ 


C/S17ssp 


4680R-IH 


28 - 3026 


CI 


4534S-IH 


7279R-IH 


2880 - 5625 


D/ 


7198S-IH 


9367R-IH 


5544 - 7713 


E/ 


9247S-NF 


9576R-NF 


7597 - 7960 



In Table 1, an amplified region is represented by nucleotide numbers in 
rSGREP-JFHl (SEQ ID NO: 1) that the region corresponds to. 
[Table 2] 



Primer 
designation 


Nucleotide sequence (5'— »3') 


SEQ ID NO: 


42S-IH 


CCCCTGTGAGGAACTACTGTCTTCACGC 


SEQ ID NO: 19 


C/S17ssp 


CCGGGAGAGCCATAGTGGTCTGCG 


SEQ ID NO: 20 


4534S-IH 


CCACTCAAAGAAAAAGTGTGACGAGCTCGC 


SEQ ID NO: 21 


7198S-IH 


GGCTTGGGCACGGCCTGA 


SEQ ID NO: 22 


9247S-NF 


GCGGTGAAGACCAAGCTCAAACTCACTCCA 


SEQ ID NO: 23 
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433R-neo 


AGAACCTGCGTGCAATCCATC 


SEO ID NO: 24 


4680R-IH 


CCCGTCATGAGGGCGTCGGTGGC 


SEO ID NO: 25 


7279R-IH 


ACCAGCAACGGTGGGCGGTTGGTAATC 


SEO ID NO: 26 


9367R-RI 


GGCACGCGACACGCTGTG 


SEQ ID NO: 27 


9576R-NF 


AGCTAGCCGTGACTAGGGCTAAGATGGAGC 


SEO ID NO: 28 



[0098] 



The composition of a reaction solution in this PCR reaction is as follows. 



Com position »f Ruction Solution Fluid Volume (\x\) 

Primer 1 (10 pM) L0 
Primer 2 (10 pM) L0 
2.5 mM dNTPs 5 -° 
lOx LA Buffer 5 -° 
MgCl 2 (25mM) 5 -° 
LA Taq (TAKARA) (5 U/pl) 0.3 
DW (distilled water) 30 - 7 

Tem plate cDNA 

Total 



50 pi 



[0099] 

In addition, PCR reaction conditions are as follows: 95°C for 2 minutes; 
35 cycles of 98°C for 10 seconds and then 68°C for 8 minutes; and 72°C for 7 
minutes; after which the temperature is kept at 4°C. 
[0100] 

The nucleotide sequence of each PCR product obtained as described above 
was determined, and then the RNA sequence corresponding to the DNA sequence 
was compared with the sequence of rSGREP-JFHl. The results are shown in 
Table 3. 
[0101] 
[Table 3] 

44 



JP Application No. 2003-329115 



Region Synonymous Nonsynonymous Total number of 

substitution substitution mutations 

NS3 0 ~~ 5 5 

NS4A 0 2 2 

NS4B 0 3 3 

NS5A 0 7 ? 

NS5B 3 5_ 8 

Total " 3 22 25 



[0102] 

As shown in Table 3, total number of nucleotide mutations observed in 21 
cell clones was 25. 22 of these mutations were nonsynonymous substitutions 
inducing amino acid mutation. Types of these mutations are as shown in Table 4. 
In addition, the positions of these mutations in the non-structural region are shown 
in Fig. 11. 
[0103] 
[Table 4] 



Clone „ Mutation site . 

designation Nucleotide No. Nucleotide mutation Amino acid mutation Amino acid 



CI 


7098 


A => G 


None 






7157 


A => G 


Y => C 


2824 


C2 


4955 


C => U 


A => V 


2090 


C3 


4936 


A => G 


T A 


2084 




5000 


A => G 


Y => C 


2105 




7287 


A G 


None 






7288 


A => G 


M V 


2868 


C4 


5901 


G => U 


E D 


2405 




6113 


A => U 


H => L 


2476 


C5 


2890 


A => G 


K E 


1402 


C6 


7209 


A => G 


None 
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[0104] 

In Table 4 and Fig. 11, "CI to C6" represent replicon-replicating eel! 
clones CI to C6 having replicon RNA found to have mutations. "Nucleotide No." 
shows the corresponding nucleotide numbers within the nucleotide sequence of 
replicon RNA rSGREP-JFHl (SEQ ID NO: 1). "Amino acid No." shows the 
corresponding amino acid numbers within the amino acid sequence encoded by the 
JFH-1 clone (SEQ ID NO: 4). The types of nucleotides and amino acids at 
mutation sites are described according to their general notations. As shown in 
Table 4, in clone C2, a nucleotide corresponding to nucleotide No. 4955 of SEQ 
ID NO: 1 on the replicon RNA mutated from C (cytosine) to U (uracil), which 
results in a mutation of an amino acid corresponding to amino acid No. 2090 of 
SEQ ID NO: 4 from A (alanine) to V (valine). 
[0105] 

Furthermore, mutation positions shown in Fig. 11 are shown with bar lines 
with the nucleotide numbers shown in Table 4. A thick bar line represents 
nonsynonymous substitution, and a thin bar line represents synonymous 
substitution. 
[0106] 

There were 2 clones having no nucleotide mutations at all that cause amino 
acid mutations. When Northern blot analysis was conducted for the 2 clones, it 
was shown that in these 2 clones, the quantity of replicon RNAs replicated was 
lower than those in the cell clones that had replicated replicon RNAs having a 
nucleotide mutation that causes an amino acid mutation. Hence, it was 
considered that the nucleotide mutation causing an amino acid mutation within the 
replicon RNA was an adaptive mutation for increasing the replication efficiency of 
the replicon RNA in Huh7 cells. 
[0107] 
[Example 6] 

(J) Establishment of replicon-replicating cell clone using cells other than Huh7 
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cells 

According to the method described in Example 1, rSGREP-JFHl was 
transacted into some hepatic cancer cells other than Huh? cells and non-liver- 
derived cells. The transfected cells were seeded into culture dishes and then 
cultured. Colony formation was observed and the number of colonies was 
counted. The cells used for transfection are as follows. 
[0108] 

(!) HepG2 cells (representative hepatic cancer cells as well as Huh7 cells) 

(2) TMY-N9 cells (established by Ito et al; fusion cells of HepG2 cells and human 
primary culture hepatic cells (Hepatology 2001, 34: 566-572)) 

(3) HeLa cells (human cervical cancer-derived cells (Can Cor Res. 1952, 12: 264- 
265)) 

(4) 293 cells (human fetal kidney-derived cells (Gen. Virol. 1977, 36: 59-72)) 
[0109] 

The results of transfection using HepG2 cells, IMY-N9 cells, HeLa cells or 
293 cells, respectively, are shown in Fig. 12a to d. As shown in Fig. 12a to d } all 
HepG2 cells, IMY-N9 cells, HeLa cells, and 293 cells showed colony formation 
for rSGREP- JFH 1 -transfected cells. 
[0110] 

For the established cell clones, detection of the replicated replicon RNA, 
confirmation of the presence or the absence of the incorporation of the neomycin 
resistance gene into host genomic DNA, and confirmation of the expression of 
HCV protein were performed as described later, (L) and (M). The cell clones, for 
which the replication of the replicon in the cells had been confirmed, were 
regarded as replicon-repiicating cell clones. Specifically, it was demonstrated 
that the use of rSGREP-JFHl also enables the preparation of HCV replicon- 
repiicating cells using hepatic cancer cells other than Huh7 cells and non-hepatic 
cells with which the production of HCV replicon-repiicating cells had previously 
been unsuccessful (Blight et al., Science, (2000) 290; 1972-1974). 
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[0111] 

(K) Detection of replicon RNA in replicon-replicating cells using cells other than 
Huh7 cells 

Northern blot analysis was conducted according to a description of 
Molecular Cloning, A laboratory Manual, 2nd edition, J. Sambrook, B. F. Fritsch, 
T. Maniatis, Cold Spring Harbor Laboratory Press (1989). In accordance with the 
descriptions of the previous section (J), total RNA was extracted by the acidic 
phenol extraction method from each of the replicon-replicating cell clones that had 
been established by transfection of rSGREP-JFHl into HepG2, IMY or HeLa cells 
respectively, and from pool clones of the replicon-replicating cells that had been 
established through transfection of rSGREP-JFHl into 239 cells (prepared by 
collecting cell clones that had formed colonies from whole one dish and culturing 
them). Next, the total RNAs were analyzed by the Northern blot method using a 
pSGREP-JFH I -specific probe as a probe. As controls, total RNAs (lanes 1 and 
17 in Fig. 13) extracted similarly from untransfected Hnh7 cells and HepG2 cells, 
and RNA (lanes 2 and 3 in Fig. 13) prepared by adding 10 7 copies or 10 s copies of 
the replicon RNA synthesized in vitro to total RNA extracted from Huh7 cells 
were used. As a result, RNA of approximately the same size of that of rSGREP- 
JFH1 was detected using a pSGREP-JFHl -specific probe (Fig. 13). Accordingly, 
it was confirmed that the replicon RNA derived from rSGREP-JFHl that had been 
transfected at the beginning was replicated and proliferated within the cell clone. 
Furthermore, it was also revealed that the quantities of replicated replicon RNAs 
differed depending on cell type, and IMY cells were found to replicate the replicon 
RNA particularly efficiently. Moreover, it was revealed that the clones differed 
from each other in the quantity of the replicated replicon RNA. 
[0112] 

(L) Confirmation of the presence or the absence of the incorporation of the 
neomycin resistance gene into genomic DNA 

For the thus established replicon RNA-replicating cell clone, PCR 
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amplification was performed using neomycin resistance gene-specific primers 
(sense primer, NEO-S3: 5'-AACAAGATGGATTGCACGCA-3' (SEQ ID NO: 29), 
antisense primer, NEO-R: 5'-CGTCAAGAAGGCGATAGAAG-3' (SEQ ID NO: 
30)) and the host cellular genomic DNA extracted from each of the cell clones as a 
template, in order to confirm that the resistance of each of the cell clones against 
G4I8 was not due to the incorporation of the neomycin resistance gene into the 
genome. The cell clones used herein were the cell clones Nos, 1, 5, 7, 8, 9, 10, 
11, 12 and 13 obtained by retransfection of rSGREP-JFHl -derived replicated 
replicon RNA into HepG2 cells, and the cell clones Nos. 3, 4, 5, 6, 7, 8, 9, 10 and 
11 obtained by retransfection of rSGREP-JFHl-derived replicated replicon RNA 
into IMY N9 cells. As a result, as shown in Fig. 14, in the nine examined cell 
clones obtained by introduction of rSGREP-JFHI into HepG2 cells, a positive 
clone showing the amplification of the neomycin resistance gene was not observed, 
In the 9 examined cell clones obtained by introduction of rSGREP-JFHl into IMY 
N9 cells, a positive clone showing the amplification of the neomycin resistance 
gene was not observed. 
[0113] 

A similar examination was performed for cell clones obtained by 
retransfection of rSGREP-JFHl-derived replicated replicon RNA into HeLa cells, 
and cell clones obtained by retransfection of rSGREP-JFHl-derived replicated 
replicon RNA into 293 cells. Then, a positive clone showing the amplification of 
the neomycin resistance gene was not observed. 
[0114] 

(M) Detection of HCV protein 

Proteins were extracted from the established cell clones by a standard 
procedure, and then analyzed by SDS-PAGE and the Western blot method (Fig. 1 5). 
The cell clones examined in this case were the same as those used in the above 
section: the cell clones Nos, 1, 5, 7, 8, 9, 10, 11, 12 and 13 obtained by 
retransfection of rSGREP-JFHl-derived replicated replicon RNA into HepG2 cells, 
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.no the cel. Cones No, 3, 1, 5, 6, 7, 8, 9, ,0 and .1 obtained by retransfec.ion of 
rS GRE W FH<-derived repUea.ed repHcon RNA into 1MY-N9 eel,s. For.bermore 
oordin g ,o . previoos repor, (Ubrpann e, a,., Se.enee, ( ,999», * - 
re p,,eo„-rep,iea. tag ce„ eione propared b y in.rodoCo g rSGREP-JFHl m.o H„H7 
was re g arded as a positive eon.ro, (Fi, .5, lane 4-,, 06). Moreover a pro.em 
extr ao,ed from tm.ransfec.ed ce„s was used as a „e g a,iv= con.ro, (F , 1 , ^ 

m e m br„„es (Immobilon-P, Miiiipore), and .ben de.ee.ion of NS3 pro.e.n eneo 

by .be rep„e,.ed rep,,co„ RNA was performed os,n g 

( p, ovW ed by O, Moradpoor; Wo,b B, e. a,, , Viroio g y, 7000, 74: 2 

posit ,e eo„,o, was de.ee.ed in .be ee„ Cones Nos. ,, 5, 7, 9, ,0 U, .7 and 
ob«a,ned by re.ransfee.ion of rSOREP.FHi-derived repUea.ed repUoon RNA and 
in .be ee„ e,o»es Nos. 3, 4, 5, 6, 7, 8, 9, ,0 and ,1 ob.ained by re.ransfee,o» of 
rS GREP-JFH> -derived repUea.ed replieon RNA into IMY N9 eells. 

101151 Moreover, by ,be nse of .be sernm of a bepa.i.is C patien, as an an.ibody, 
th e eonfirma.ion of.be expression of NS5a pro,,,, from ,be repiicon RNA was 
performed for eacb cC, e.one .ba. bad been eonf.rmed above ,o express NS3 
pro ,e,n. ,n .bis experimen, examina.ion was performed in a manner Sim „ar o 
1, in .be ease of tbe expression of NS3 protein, bn. „sin g an an.ibody ms.ead of 
the sernm of tbe pa«,e„, As a resn,,, as sbown in .be ,ow see.ion in F, g . ,5, a 
prot e,„ of.be same size as .ba. of.be posbive co,ro, was de.ec.ed ,n be ,« 
e,ones Nos. ,, 5, 7, 8, 9, ,0, 11, .2 and ,3 ob,aioed by retr.nsfecUon 
JF H1 -derived repUea.ed replieon RNA, and .he eel, eiones Nos. 3, 4, 5, 6, 7, 8, , 
10 and „ ob.amed by re.ransfee.ion of rSGREP-JFH 1 -derived repUea.ed repbeon 
RNA into IMY N9 cells. 

101161 W „e„ sim„ar examioa.ion was performed for tbe eel, e,o„es obtained by 
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re » r ansfeo,i„n of rSGREP-JFHl-derived repHoa.ed .epbeon RNA ,n,o HeL eCK 
a „ d ,be oe„ Cones opined b y ,e„ansfeo,,on of .SGREP-Mi-denved 
:;, ioon RNA i,o ,93 eeHs, .be evasion of NS3 a., .ba, of NS 5 a p,„,e.„ S 
could be confirmed. 

[0 ' As neso.ibed above, it was eonfnn,ed that .be .ep.ieon RNA was .epHca.ed 
,„ the „„ Cooes «* had boon es.ab,,sbed ,b,on g b .ranafeo.ion of.be .epbcon 

RNA. 
[0118] 
[Example 7] 

(N) Analysis of adaptive mutation 

Aooo,din g ,o .be daaeHpfioos of E*a»,p,e 3, -a, *NAs ob.a.ne f 0,0 be 
..epUeon-.epboa.ins oei, Cooes es.abHsbed ,b,oo g b «be 

Ml .0.0 Hep 02 and HeLa eeiis were ,e-,ransfee,ed into ano.be, oells of ,b eae 
cell line. ,sp e.ivei, so ,a. ,4 eel, elones we, es.abiisbed for HepO. ee, an 
e e „ J s we ,e e,ab„sbed tor Heba ee„s. P,o m eacb of ,ese ee„ « 
„.a, RNA was exi.ao.ed by a s.anda.d proeed.ne. e D NA eo,.espo„d,n g ,0 .be 

::::„ RNA - - - - • — n: 

tra „ s e,ip,se Snpe.se.ip. n (invb.o.en) and a p»- ^ ' 
GCACTCTCTGCAGTCATGCGGCTCACGGAC-3' (SEQ ID NO: 3,)). 

:::ii.io„ » f . «*. — - *. - 

transcription reaction is as shown below. 
[0119] 



5x 1 st strand Buffer 
2 mM dlNTP 
0.1 MDTT 

9651R-IH primer (100 ^M) 
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DW (distilled water) 6 - 5 
Sample RNA (2 mg/raL) 1 
RNAsin (Promega)(40 U/p.L) 0.5 

Superscript Jl KT flnvitrogen) 1 — 

Total 20 ^ 



[0120] 

In cDNA synthesis reaction, the above reagents other than RNAsin and 
Superscript II were mixed to prepare a first reaction solution. The first reaction 
solution was heated at 90°C for 3 minutes, and then cooled on ice. Subsequently, 
RNAsin and Superscript II were added to the reaction solution, and then the 
solution was allowed to react at 42°C for 1 hour, followed by another reaction at 
70°C for 1 5 minutes. 
[0121] 

Furthermore, PCR amplification was performed using the thus obtained 
cDNA together with five primer sets by the following procedures, so that DNA 
amplification fragments covering almost all the regions of replicon RNA were 
obtained. The primer sets used and regions amplified by each primer set are 
shown in Table 5 and Table 6 below. 
[0122] 
[Table 5] 



Designation of 
amplified fragment 


Primer set 

Primer 2 


Amplified region 


Primer 1 
A 42S-IH 
B C/S17ssp 
C 4534S-IH 
D 7198S-IH 
f 9247S-NF 


433R-neo 
4680R-IH 
7279R-IH 
9367R-IH 
9576R-NF 


41-470 

28-3026 

2280-5625 

5544-7713 

7597-7966 


In this table, an amplified region is represented by 


nucleotide numbers in 
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rSGREP-JFHl (SEQ ID NO: 1) that the region corresponds to. 

[0123] 

[Table 6] 



Primer 

Desi gnation . 

43S-IH 

C/S17ssp 

4534S-IH 

7198S-IH 

9247S-NF 

433R-neo 

4680R-IH 

7279R-IH 

9367R-IH 

Q576R-NF 



Nucleotide Sequence (5' to 3') 



SEQ ID NO: 



CCCCTGTGAGGAACTACTGTCTTCACGC 
CCGGGAGAGCCATAGTGGTCTGCG 
CCACTCAAAG AAAAAGTGTGACGAGCTCGC 

GGCTTGGGCACGGCCTGA 
GCGGTGAAGACCAAGCTCAAACTCACTCCA 

AGAACCTGCGTGCAATCCATC 

CCCGTCATGAGGGCGTCGGTGGC 

ACCAGCAACGGTGGGCGGTTGGTAATC 

GGAACGCGACACGCTGTG 
_ A r,r.TAGCCGTGAr.TAGGGCTA A G ATGGAGC . 



SEQ ID NO: 14 
SEQ ID NO: 15 
SEQ ID NO: 16 
SEQ ID NO: 17 
SEQ ID NO: 18 
SEQ ID NO: 19 
SEQ ID NO: 20 
SEQ ID NO: 21 
SEQ ID NO: 22 
_SF.n ID NO: 23 



[0124] 

The composition of a reaction solution in this PGR reaction is as follows. 

aynpjmtip^^ Fhiid^imeXiill 

Primer 1 (10 pM) 
Primer 2 (10 pM) 
2.5 mM dNTPs 
lOx LA Buffer 
MgCl 2 (25 mM) 
LATaq (TAKARA) (5 U/pl) 
DW (distilled water) 
Tsm plate cDNA — — 



1.0 
1.0 
5,0 
5.0 
5.0 
0.3 
30.7 
_^0_ 



Total 
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[0125] 

In addition, PCR reaction conditions are as follows: 95°C for 2 minutes; 
35 cycles of 98°C for 10 seconds and then 68°C for 8 minutes; followed by 72°C 
for 7 minutes, after which the temperature is kept at 4°C, 
[0126] 

The nucleotide sequence of each PCR product obtained as described above 
was determined, and then the RNA sequence corresponding to the DNA sequence 
was compared with the sequence of rSGREP-JFHl. The results are shown in 
Table 7 and Table 8. 
[Table 7] 

Analysis of adaptive mutation of JFH-1 replicon in HepG2 ceils 



Cione Mutation site Mutation 

Nucleotide No. Amino acid No. Nucleotide Amino acid 



HepIHl 


6826 


2714 


C=^A 




HepIH3 


6887 


2734 


C=>A 


T=>N 


HepIHS 


6887 




U=^A 


None 


HepIH8 


6580 


2632 


U=^A 


S^>T 




7159 


2825 


U^>C 


Y^>H 


HepIH9 


3342 




A_>G 


None 




3594 




C=>A 


None 




7230 


2848 


U_>A 


N=>K 


HepIHIO 


5052 




U=>C 


None 




6943 


2753 


C^A 


P^>T 


HepIHl 2 


None 








HepIH13 


4302 




C_>U 


None 




5687 


2334 


G=>A 


G=^D 




6110 


2475 


A=>G 


Y_>C 



[0127] 

As shown in Table 7, in the case of HepG2 cells, a total of 13 nucleotide 
mutations were observed in 8 cell clones, and 8 of these mutations were 
nonsynonymous substitutions that cause amino acid mutations. Types of these 
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mutations are shown in Table 8. On the other hand, in the case of HeLa cells, a 
total of 7 nucleotide mutations were observed in 3 cell clones, and 5 of these 
mutations were nonsynonymous substitutions that cause amino acid mutations. 
Types of these mutations are shown in Table 8. 
[0128] 
[Table 8] 

Analysis of adaptive mutation of JFH-1 replicon in HeLa cells 



Clone Mutation site Mutation 

Nucleotide No. Amino acid No. Nucleotide Amino acid 



HeLaHl 


None 








HeLaIH2 


5550 


2272 


U^>C 


s=>? 




6252 




A=>G 


None 




7182 




U=>C 


None 




7217 


2844 


A=>G 


H=>R 


HeLalHS 


3643 


1653 


A=>G 


M=>V 




5851 


2389 


G=>A 


A=>T 




5914 


2410 


G: >A 


E=>K 



[0129] 

In Tables 7 and 8, "HepIH No." represents clone numbers of replicon- 
replicating cell clones that have replicon RNA and have been cloned using HepG2 
cells. "Nucleotide No." shows the corresponding nucleotide number in the 
nucleotide sequence (SEQ ID NO: I) of replicon RNA rSGREP-JFHl. "Amino 
acid No." shows the corresponding amino acid number in the amino acid sequence 
(SEQ ID NO: 4) encoded by the JFH-1 clone. The types of nucleotides and 
amino acids at mutation sites are described according to their general notations. 
As shown in Table 7, for example, in clone HepIHl, a nucleotide corresponding to 
nucleotide No. 6826 of SEQ ID NO: on the replicon RNA mutated from C to A, so 
that an amino acid corresponding to amino acid No. 2714 of SEQ ID NO: mutated 
from Q to E. Similarly, in Table 8, "HeLalH No." represents numbers of 
replicon-replicating cell clones that have replicon RNA and have been cloned 
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using HeLa cells. 
[0130] 

In addition, when Northern blot analysis was conducted for clones having 
no nucleotide mutations at all that cause amino acid mutations, it was shown that 
the quantity of replicon RNA replicated by the clones was lower than that of a cell 
clone replicating replicon RNA having a nucleotide mutation that causes an amino 
acid mutation. Hence, it was concluded that the nucleotide mutation in replicon 
RNA inducing an amino acid mutation was an adaptive mutation for increasing the 
replication efficiency of replicon RNA in cells. 
[Industrial Applicability] 
[0131] 

The replicon-replicating cells according to the present invention can be 
utilized as a culture system for the continuous production of HCV genotype 2a- 
derived RNA and HCV protein. Moreover, the replicon-replicating cells 
according to the present invention are useful as a test system for screening for 
various substances affecting the replication of HCV and/or the translation into 
HCV protein. 

[Brief Description of Drawings] 
[0132] 

[Fig, 1] Fig. 1 is a schematic view showing procedures for constructing a 
template DNA for preparing the HCV- RNA replicon according to the present 
invention. The upper section of Fig. 1 shows the structure of the region within 
pJFHl or pJCHl, with the viral genome inserted into it. The lower section of 
Fig. 1 shows the structure of the region within plasmid DNA pSGREP-JFHl or 
pSGREP-JCHl, with the viral genome inserted into it, that had been constructed 
by substituting a part of viral genome-inserted region of pJFHl or pJCHl with a 
DNA fragment containing a neomycin resistance gene and EMCV IRES. Symbols 
in Fig. 1 are as described below. T7, T7 RNA promoter; G, dGTP that was 
inserted upstream of the 5' end of the inserted DNA derived from JFH-1 or JCH-1 
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and downstream of the 3' end of T7 RNA promoter sequence; 5' NTR, 5' 
untranslated region; Core, core protein; and 3' NTR, 3' untranslated region. El 
and E2 represent envelope proteins. NS2, NS3, NS4A, NS4B, NS5A and NS5B 
represent non-structural proteins. Age I, Cla I and Xba I represent cleavage sites 
of restriction enzymes Age I, Cla I and Xba I, respectively. GDD, the position of 
amino acid motif GDD corresponding to the active center of NS5B protein; neo, 
neomycin resistance gene; and EMCV IRES, internal ribosome entry site of 
encephalomyocarditis virus (EMCV IRES). 

[Fig. 2A] Fig. 2A shows the nucleotide sequence of rSGREP-JFHl . 

[Fig. 2B] Fig. 2B shows the nucleotide sequence of rSGREP-JFHl . 

[Fig. 2C] Fig. 2C shows the nucleotide sequence of rSGREP-JFHl . 

[Fig. 2D] Fig. 2D shows the nucleotide sequence of rSGREP-JFHl. 

[Fig. 2E] Fig. 2E shows the nucleotide sequence of rSGREP-JFHl . 

[Fig. 2F] Fig, 2F shows the nucleotide sequence of rSGREP-JFHl. 

[Fig, 3A] Fig. 3A shows the nucleotide sequence of rSGREP-JCHl. 

[Fig. 3B] Pig. 3B shows the nucleotide sequence of rSGREP-JCHl. 

[Fig. 3C] Fig. 3C shows the nucleotide sequence of rSGREP-JCHl . 

[Fig. 3D] Fig. 3D shows the nucleotide sequence of rSGREP-JCHl . 

[Fig. 3E] Fig. 3E shows the nucleotide sequence of rSGREP-JCHl . 

[Fig. 3F] Fig. 3F shows the nucleotide sequence of rSGREP-JCHl . 

[Fig. 4] Fig. 4 shows photographs showing the colony formation of Huh7 
cells to which rSGREP-JFHl, rSGREP-JFH 1/GND and rSGREP-JFHl/dGDD was 
transfected, respectively. The amount of each of three transfected RNAs in the 
upper section was 100 ng and that of three transfected RNAs in the lower section 
was 300 ng. 

[Fig. 5] Fig. 5 shows photographs showing colony formation of Huh7 cells 
to which rSGREP-JFHl and rSGREP-JCHl respectively had been transfected 
when the concentration of G418 was 0.5 mg/ml of the medium. The amount of 
each of these RNAs transfected was 100 ng. 
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[Fig. 6] Fig. 6 shows photographs showing the effect of Mung Bean 
Nuclease treatment conducted on the colony- forming ability of the tiansfected 
cells. The amount of rSGREP-JFHl RNA tiansfected was 100 ng for both cases. 
The concentration of G418 was 1,0 mg/ml in both media. 

[Fig. 7] Fig. 7 shows photographs showing colony formation when total 
cellular RNA derived from the replicon-replicating cell clone, which had been 
established by transfection of rSGREP-JFHl, was retransfected to another Huh7 
cells. The photograph on the left shows that the formation of 96 colonies was 
observed as a result, when using the total cellular RNA derived from the replicon- 
replicating cell clone No. 6. The photograph on the right shows that the 
formation of 77 colonies was observed as a result, when using the total cellular 
RNA derived from the pool clones. In both cases, RNA was retransfected in an 
amount containing IxlO 7 copies of the replicon RNA. 

[Fig, 8] Fig. 8 shows photographs showing the results of detecting by the 
Northern blot method using an rSGREP-JFHl -specific probe for the total RNA 
derived from a cell clone that had been obtained by retransfecting the total cellular 
RNA (derived from the replicon-replicating cell clone established by transfection 
of rSGREP-JFHl) into another Huh7 cells. Explanation of the lanes is as 
follows. 10 s represents sample prepared by adding 10 8 copies of the replicon 
RNA synthesized in vitro to total RNA extracted from Huh7 cells. 10 7 represents 
sample prepared by adding 10 7 copies of the replicon RNA synthesized in vitro to 
total RNA extracted from Huh7 cells. Huh7, total RNA extracted from 
untransfected Huh7 cells; pool clone, total RNA extracted from the pool clones; 
and 1-11, total RNA extracted from each of cell clones Nos. 1 to 11. "Replicon 
RNA" represents the electrophoresed position of a molecular weight marker 
indicating the size of rSGREP-JFHl, "28S" represents the same of a ribosomal 
RNA marker indicating the size of molecular weight of 4.5 kb, and "18S" 
represents the same of a ribosomal RNA marker indicating the size of molecular 
weight of 1.9 kb. 
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[Fig. 9] Fig. 9 shows photographs showing the presence or the absence of 
the incorporation of a neomycin resistance gene into the genomic DNA of a host 
cell in the cell clone to which rSGREP-JFHl- or rSGREP-JCHl -derived replicated 
replicon RNA was retransfected. Explanation of the lanes in the photograph on 
the left is as follows. M, DNA molecular weight marker; 1-8, rSGREP-JFHl- 
derived cell clones Nos. 1 to 8; N, untransfected Huh7 cells; and P, positive 
control (PCR amplification product of the neomycin resistance gene). 
Furthermore, explanation of the lanes in the photograph on the right is as follows. 
M, DNA molecular weight marker; and 1-6, rSGREP-JCHl -derived cell clones 
Nos. I to 6. 

[Fig. 10] Fig. 10 shows photographs showing the results of detecting NS3 
protein expressed in the cell clone that was retransfected with rSGREP-JFHl - or 
rSGREP-JCHl -derived replicated replicon RNA. Lanes 1 to 8 of the photograph 
on the left represent rSGREP-JFHl-derived ceil clones Nos. 1 to 8. Lanes 1-6 of 
the photograph on the right represent rSGREP-JCHl -derived cell clones Nos. 1 to 
6. Lane P of the photograph on the right represents NS3 protein (positive 
control) and N represents protein extracted from untransfected Huh7 cells 
(negative control). 

[Fig. 11] Fig. 11 shows the positions of nucleotide mutations in replicon 
RNAs obtained from 21 cell clones that were established through the re- 
transfection of rSGREP-JFHl -derived replicated replicon RNA into Huh7 cells. 
Mutation positions are indicated using bar lines shown with nucleotide numbers 
listed in Table 4. A thick bar line denotes nonsynonymous substitution and a thin 
bar line denotes synonymous substitution. 

[Fig. 12] Fig. 12 shows photographs showing the results of transfection 
with rSGREP-JFHl using 1) HepG2 cells; 2) IMY-N9 cells; 3) 293 cells; or 4) 
HeLa cells. 

[Fig. 13] Fig. 13 shows photographs showing the results of performing 
Northern blotting for repiicon-replicating cell clones. 
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[Fig. 14] Fig. 14 shows photographs showing the results of electrophoresis 
performed for confirming the incorporation of the neomycin resistance gene into 
genomic DNA. 

[Fig. 15] Fig. 15 shows photographs showing the results of analyzing by 
the Western blot method proteins derived from the replicon-replicating cell clones. 
[Sequence Listing Free Text] 
[0133] 

SEQ ID NO: 1. Explanation of artificial sequence: replicon 

SEQ ID NO: 2. Explanation of artificial sequence: replicon 

SEQ ID NO: 7. Explanation of artificial sequence: replicon 

SEQ ID NOS: 8 to 12, Explanation of artificial sequences: synthetic RNA 

SEQ ID NOS: 13 to 41. Explanation of artificial sequences: synthetic DNA 

[Sequence Listing] 

SEQUENCE LISTING 

<110> Toray Industries Inc. 

Tokyo Metropolitan Organization for Medical Research 
Johannes Qutenberg-Universitaet Mainz 

<120> Establishment of the genotype 2a Hepatitis C virus subgenomic 
replicon 

<130> P03-08S2 



<150> JP 2003-148242 
<151> 2003-05-26 

<160> 41 
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<170> Patentln Ver . 2.1 



<210> 1 

<211> 8024 

<212> RNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: replicon 



<220> 

<223> Inventor: Wakita, Takaji 
inventor: Kato, Takanobu 
Inventor: Date, Tomoka 



<400> 1 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 

cuucacgcag aaagcgcoua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 

cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 

aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 24 0 

caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 3 00 

cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 3 60 

ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 42 0 

cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 

aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 54 0 

ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 

guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 

aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 

uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 

ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 84 0 

ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
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cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 114 0 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 12 0 0 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuatmu uccaccauau ugccgucuuu uggcaaugug agggcccgga 13 2 0 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 162 0 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga oggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2 04 0 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 24 0 0 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 2460 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2 58 0 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2 64 0 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaagag guaggccucg ggcgggaggg ugagaucccc 2820 
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uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 28 8 0 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2 94 0 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3 0 00 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3 060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 318 0 
agacagggca cuvtauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 324 0 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3300 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccaeau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3 780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3 840 
gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3 900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 4020 
cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 42 6 0 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 43 2 0 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gvagacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 
agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 4620 
uccuggcxicc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
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uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4 92 0 
cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaauougaa aauuccuugc 5040 
caacuacouu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaocu gagcccgacg cagacguauu gagguccaug 52 2 0 
cxiaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 5400 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 54 60 
auggccgagg aagagagcga ccmigagccc ucaauaccau cggagugcau gcuccccagg 5520 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccgmig cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 612 0 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 624 0 
aaggucagcg caaggcuccu oaccuuggag gaggcgugcc aguugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 6360 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 64 2 0 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 64 8 0 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6 54 0 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6 60 0 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
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uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 69 0 0 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau gcugguaugc 69 6 0 
ggcgaugacc uaguagucau cucagaaagc caggggacug aggaggacga gcggaaccug 7020 
agagccuuca cggaggccau gaccagguac ucugccccuc cuggugaucc ccccagaccg 7080 
gaauaugacc uggagcuaau aacauccugu uccucaaaug ugucuguggc guugggcccg 7140 
cggggccgcc gcagauacua ccugaccaga gacccaacca cuccacucgc ccgggcugcc 72 00 
ugggaaacag uuagacacuc cccuaucaau ucaoiggcugg gaaacaucau ccaguaugou 7260 
ccaaccauau ggguucgcau gguccuaaug acacacuucu ucuccauucu caugguccaa 7320 
gacacccugg accagaaccu caacuuugag auguauggau caguauacuc cgugaauccu 73 80 
uuggaccuuc cagccauaau ugagagguua cacgggcuug acgccuuuuc uaugcacaca 7440 
uacucucacc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagggugu ggaagagucg ggcucgcgca gucagggcgu cccucaucuc ccguggaggg 7560 
aaagcggccg uuugcggccg auaucucuuc aauugggcgg ugaagaccaa gcucaaacuc 7620 
acuccauugc cggaggcgcg ccuacuggac uuauccaguu gguucaccgu cggcgccggc 7680 
gggggcgaca uuuuucacag cgugucgcgc gcccgacccc gcucauuacu cuucggccua 774 0 
cuccuacuuu ucguaggggu aggccucuuc cuacuccccg cucgguagag cggcacacac 7800 
uagguacacu ccauagcuaa cuguuccuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7860 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc uuucuucccu ucucaucuua uucuacuuuc 7920 
uuucuuggug gcuccaucuu agcccuaguc acggcuagcu gugaaagguc cgugagccgc 7980 
augacugcag agagugccgu aacuggucuc ucugcagauc augu 8 02 4 



<210> 2 

<2H> 8024 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
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<400> 2 

acccgccccu aauaggggcg acacuacgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
ugcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa aucccaaacc 360 
ucaaagaaaa accaaaagaa acacuaaccg ucgcccaaug auugaacaag auggauugca 42 0 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 54 0 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 66 0 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 84 0 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 90 0 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaueauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 114 0 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 12 00 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 12 60 
cguuugucua ua\iguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 13 80 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgauaau accauggccc ccaucaccgc uuacgcccag 1800 
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cagacacgag gucucuuggg cucuauagug gugagcauga cggggcguga caagacagaa 1860 
caggccgggg agguccaagu ccuguccaca gucacucagu ccuuccucgg aacauccauu 192 0 
ucgggggucu uauggacugu uuaccacgga gcuggcaaca agacacuagc cggcucgcgg 1980 
ggcccgguca cgcagaugua cucgagcgcc gagggggacu uggucgggug gcccagcccu 204 0 
ccugggacca aaucuuugga gccguguacg uguggagcgg ucgaccugua uuuggucacg 2100 
cggaacgcug augucauccc ggcucgaaga cgcggggaca agcggggagc gcugcucucc 2160 
ccgagacccc uuucgaccuu gaaggggucc ucggggggac cugugcuuug cccuaggggc 2220 
cacgcugucg gaaucuuccg ggcagcugug ugcucucggg guguggcuaa guccauagau 2280 
uucauccccg uugagacgcu cgacaucguc acgcggucuc ccaccuuuag ugacaacagc 2340 
acaccaccag cugugcccca gaccuaucag gugggguacu ugcacgcccc cacuggcagu 2400 
ggaaaaagca ccaagguccc cgucgcguac gccgcccagg gguauaaagu gcuggugcuc 2460 
aaucccucgg uggcugccac ccugggauuu ggggcguacu uguccaaggc acauggcauc 2520 
aaccccaaca uuaggacugg agucagaacu gugacgaccg gggagcccau uacauacucc 2S80 
acguauggua aauuccucgc cgaugggggc ugcgcaggcg gcgccuauga caucaucaua 2 64 0 
ugcgaugaau gccacucugu ggaugcuacc acuauucucg gcaucgggac aguccuugac 2700 
caagcagaga cagccggggu caggcuaacu guacuggcca cggccacgcc ccccgggucg 2760 
gugacaaccc cccaucccaa uauagaggag guagcccucg gacaggaggg ugagaucccc 2820 
uucuauggga gggcguuucc ccugucuuac aucaagggag ggaggcacuu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgca acggcccuuc ggggcauggg cuugaacgcu 2 94 0 
guggcauauu acagaggguu ggacgucucc auaauaccaa cucaaggaga uguggugguc 3 000 
guugccaccg acgcccucau gacgggguau acuggagacu uugacuccgu gaucgacugc 3060 
aacguagcgg ucacccaggc cguagacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cgcaagacgc ugucucacgu agucagcgcc gagggcgcac ggguagagga 3180 
agacugggca uuuauaggua uguuuccacu ggugagcgag ccucaggaau guuugacagu 3240 
guaguacucu gugagugcua cgacgcagga gcugcuuggu augagcucuc accaguggag 33 00 
acgaccguca ggcucagggc guauuucaac acgccuggcu ugccugugug ccaggaccac 33 60 
cuugaguuuu gggaggcagu uuucaccggc cucacacaca uagacgcuca uuuccuuucc 3420 
cagacaaagc agucggggga aaauuucgca uacuuaguag ccuaucaggc cacagugugc 34 80 
gccagggcca aagcgccccc cccguccugg gacgucaugu ggaagugcuu gacucgacuc 3540 
aagcccacgc uugugggccc uacaccucuc cuguaccguu ugggcucugu uaccaacgag 3600 
gucacccuua cacaccccgu gacaaaauac aucgccacau gcaugcaagc ugaccucgag 3660 
gucaugacca gcacgugggu ccuggcuggg ggagucuuag cagccgucgc cgcguauugc 3720 
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uuagcgaccg gguguguuvsc caucauuggc cguuuacaca ucaaccagcg agcugucguc 3780 
gcuccggaca aggagguccu cuaugaggcu uuugaugaga uggaggaaug ugccuccaga 3 84 0 
gcggcucucc uugaagaggg gcagcggaua gccgagaugc ugaaguccaa gauccaaggc 3 900 
uuauugcagc aagccucuaa acaggcccag gacauacaac ccgcugugca agcuucgugg 3 960 
cccaagaugg agcaauucug ggccaaacau auguggaacu ucauaagcgg cauucaguac 4 020 
cucgcaggac ugucaacacu gccagggaac ccugcugugg cuuccaugau ggcauucagc 4Q80 
gccgcccuca ccaguccguu gucaacuagc accaccaucc uucuuaacau ucuggggggc 4 140 
uggcuggcgu cccaaauugc gccacccgcg ggggccacug gcuuuguugu caguggccug 4200 
gugggagcug cuguuggcag cauaggcuug gguaaagugc ugguggacau ccuggcaggg 4260 
uauggugcgg gcaumicggg ggcccucguc gcguuuaaga ucaugucugg cgagaagccc 4320 
uccauggagg augucaucaa cuugcugccu gggauucugu cuccaggugc ucugguggug 4380 
ggagucaucu gcgcggccau ucugcgccgc caugugggac cgggggaagg cgcgguccaa 4440 
uggaugaaca ggcuuaucgc cuucgcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcguguc aocoaacugo uuggcucucu cacuauaacu 4560 
agucuacuca ggagacuuca caacuggauc acugaggauu gccccauccc augcgccggc 4 62 0 
ucguggcucc gcgaugugug ggacuggguc uguaccaucc uaacagacuu uaagaacugg 4 680 
cugaccucca agcuguuccc aaagaugccu ggccuccccu uuaucucuug ccaaaagggg 474 0 
uacaagggcg ugugggccgg caouggcauc augaccacac gaugccccug cggcgccaac 4800 
aucucuggca acguccgcuu gggcucuaug agaaucacag gacccaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugu uauacagaag gccagugcuu gccgaaaccc 4920 
gcguuaaacu ucaagaccgc caucuggaga guggcggccu cagaguacgc ggaagugacg 4980 
cagcacggau cauaugccua uauaacaggg cugaccacug acaacuuaaa agucccuugc 5040 
caacuccccu cuccagaguu uuucucuugg guggacggag uacaaaucoa uagguccgcc 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguuca gcguugggcu caauucauuu 5160 
gucgucgggu cucagcuucc cugugacccu gagcccgaca cugagguagu gauguccaug 5220 
cuaacagacc caucccauau cacggcggag gcugcagcgc ggcguuuagc gcggggguca 5280 
cccccaucug aggcaagcuc cucagcgagc cagcugucgg cgccaucgcu gcgagccacc 5340 
ugcaccaccc acgguaggac cuaugaugug gacauggugg augccaaccu guucaugggg 5400 
ggcggcguga uucggauaga gucugagucc aaaguggucg uucuggacuc ccucgacuca 54 6 0 
augaccgagg aagagggcga ccuugagccu ucaguaccau cggaguauau gcuccccagg 5520 
aagagguucc caccggccuu accggcuugg gcgcggccug auuacaaccc accgcuugug 5580 
gaaucgugga agaggccaga uuaccaacca cccacuguug cgggcugugc ucuccccccc 564 0 



68 



JP Application No. 2003-329 1 1 5 



cccaaaaaga ccccgacgcc uccuccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaggag augcccucca acagcuggcc aucaaguccu uuggccagcc ccccccaagc B760 
ggcgauucag gccuuuccac gggggcggac gccgccgacu ccggcgaucg gacacccccu 5820 
gacgaguvsgg cucuuucgga gacagguucu accuccucca ugcccccccu cgagggggag 58S0 
ccuggggacc cagaccugga gccugagcag guagagcuuc aaccuccucc ccaggggggg 594 0 
gaggcagcuc ccggcucgga cucggggucc uggucuacuu gcuccgagga ggaugacucc 6000 
gucgugugcu gcuccauguc auauuccugg accggggcuc uaauaacucc uuguagcccc 6060 
gaagaggaaa aguugcoaau uaacuccuug agcaacucgc uguugcgaua ccauaacaag 6120 
guauacugua cuacaucaaa gagugccuca cuaagggcua aaaagguaac uuuugauagg 6180 
augcaagugc ucgacgccua uuaugauuca gucuuaaagg acaucaagcu agcggccucc 6240 
aaggucagcg caaggcuccu caccuuagag gaggcgugcc aauugacccc accccacucu 63 00 
gcaagaucca aguauggguu uggggcuaag gagguccgca gcuuguccgg gagggccguc 63 60 
aaccacauca aguccgugug gaaggaccuc uuggaagacu cacaaacacc aauuccuaca 642 0 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaaaaaacca 6480 
gcucgccuua ucguuuaccc ugaccucggc gucagggucu gcgagaagau ggcccuuuau 6 54 0 
gaugucacac aaaagcuucc ucaggcggug augggggcuu cuuauggcuu ccaguacucc 6600 
cccgcucagc ggguggaguu ucucuugaag gcaugggcgg aaaagagaga cccuaugggu 6660 
uuuucguaug auacccgaug cuuugacuca accgucacug agagagacau caggacugag 6720 
gaguccauau accaggccug cuccuuaccc gaggaggccc gaacugccau acacucgcug 6780 
acugagagac ucuauguggg agggcccaug uucaacagca agggccaguc cugcggguac 6840 
aggcguugcc gcgccagcgg ggugcuuacc acuaguaugg ggaacaccau cacaugcuau 6900 
guaaaagccc uagcggcuug caaggcugcg gggauaauug cgcccacgau gcugguaugc 6960 
ggcgacgacu uggucgucau cucagaaagc caggggacug aggaggacga gcggaaccug 702 0 
agagccuuca cggaggcuau gaccagguau ucugccccuc cuggugaccc ccccagaccg 7080 
gaauaugacc uggagcuaau aacaucuugu uccucaaacg ugucuguggc acuuggccca 714 0 
cagggccgcc gcagauacua ccugaccaga gaccccacca cuucaauugc ccgggcugcc 7200 
ugggaaacag uuagacacuc cccugucaau ucauggcugg gaaacaucau ccaguacgcu 7260 
ccaaccauau ggguucgcau gguccugaug acacacuucu ucuccauucu cauggcccag 7320 
gacacccuag accagaaccu uaacuuugaa auguacggau cgguguacuc cgugaguccu 73 80 
cuggaccucc cagccauaau ugaaagguua cacgggcuug acgccuucuc ucugcacaca 7440 
uacacucccc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagagcgu ggaagagucg ggcgcgugca guuagggcgu cccucaucuc ccgugggggg 7560 
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agggcggccg uuugcggucg guaccucuuc aacugggcgg ugaagaccaa gcucaaacuc 7 62 0 
acuccuuugc cggaggcacg ccuccuggau uuguccaguu gguuuaccgu cggcgccggc 7 68 0 
gggggcgaca uuuaucacag cgugucgcgu gcccgacccc gccuauuacu ccuuagccua 7 74 0 
cuccuacuuu cuguaggggu aggccucuuc cuacuccccg cucgauagag cggcacacau 7800 
uagcuacacu ccauagcuaa cuguuccuxiu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 78 6 0 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc uuucuucccu ucucaucuua uucuacuuuc 792 0 
uuucuuggug goucoaucuu agcccuaguc acggcuagcu gugaaagguc cgugagccgc 798 0 
augacugcag agagugccgu aacuggucuc ucugcagauc augu 8 024 



<210> 3 
<211> 9678 
<212> DNA 

<213> Hepatitis C virus 

<220> 
<221> CDS 

<222> (341) . . (9442) 
<400> 3 

acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg age aca aat cct 3 55 
Met Ser Thr Asn Pro 
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aaa cct caa aga aaa acc aaa aga aac acc aac cgt cgc cca gaa gac 
Lys Pro Gin Arg Lys Thr Lys Arg A6n Thr Asn Arg Arg Pro Glu Asp 



gtt aag ttc ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 451 
Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly Gly Val Tyr Leu Leu 



ccg cgc agg ggc ccc agg ttg ggt gtg cgc acg aca agg aaa act teg 499 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr Thr Arg Lys Thr Ser 



gag egg tec cag cca cgt ggg aga cgc cag ccc ate ccc aaa gat egg 547 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro lie Pro Lys Asp Arg 



cgc tec act ggc aag gee tgg gga aaa cca ggt cgc ccc tgg ccc eta 595 
Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly Arg Pro Trp Pro Leu 



tat ggg aat gag gga etc ggc tgg gca gga tgg etc ctg tec ccc cga 643 

Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
90 95 100 

ggc tct cgc ccc tec tgg ggc ccc act gac ccc egg cat agg teg cgc 691 

Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg His Arg Ser Arg 
105 110 115 

aac gtg ggt aaa gtc ate gac acc eta acg tgt ggc ttt gee gac etc 739 

Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
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atg ggg tac ate ccc gtc gta ggc gec ccg ctt agt ggc gec gec aga 787 

Met Gly Tyr lie Pro Val Val Gly Ala Pro Leu Ser Gly Ala Ala Arg 
135 140 145 

get gtc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat tat gca 83 5 

Ala Val Ala His Gly Val Arg Val Leu Glu Asp Gly val Asn Tyr Ala 
150 155 150 165 

aca ggg aac eta ccc ggt ttc ccc ttt tct ate ttc ttg ctg gec ctg 883 

Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser lie Phe Leu Leu Ala Leu 
170 175 180 



ttg tec tgc ate acc gtt ccg gtc tct get gec cag gtg aag aat ace 931 

Leu Ser Cys lie Thr Val Pro Val Ser Ala Ala Gin Val Lys Asn Thr 
185 190 195 

agt age age tac atg gtg acc aat gac tgc tec aat gac age ate act 979 

Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser Asn Asp Ser He Thr 

200 205 210 

tgg cag etc gag get gcg gtt etc cac gtc ccc ggg tgc gtc ccg tgc 1027 

Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 
215 220 225 

gag aga gtg ggg aat acg tea egg tgt tgg gtg cca gtc teg cca aac 1075 

Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val Pro Val Ser Pro Asn 
230 235 240 245 

atg get gtg egg cag ccc ggt gec etc acg cag ggt ctg egg acg cac 1123 

Met Ala Val Arg Gin Pro Gly Ala Leu Thr Gin Gly Leu Arg Thr His 
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ate gat atg gtfc gtg atg tec gec acc ttc tgc tct get etc tac gtg 1171 
lie Asp Met val Val Met Ser Ala Thr Phe Cys Ser Ala Leu Tyr Val 
265 270 275 

ggg gac etc tgt ggc ggg gtg atg etc gcg gee cag gtg ttc ate gtc 1219 
Gly Aap Leu Cys Gly Gly Val Met Leu Ala Ala Gin Val Phe lie Val 
280 285 290 

teg ccg cag tac cac tgg ttt gtg caa gaa tgc aat tgc tec ate tac 1267 
Ser Pro Gin Tyr His Trp Phe Val Gin Glu Cys Asn Cys Ser lie Tyr 
295 300 305 

cct ggc acc ate act gga cac cgc atg gca tgg gac atg atg atg aac 1315 
Pro Gly Thr lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Aen 
310 315 320 325 

tgg teg ccc acg gee acc atg ate ctg gcg tac gtg atg cgc gtc ccc 1363 
Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr Val Met Arg Val Pro 
330 335 340 

gag gtc ate ata gac ate gtt age ggg get cac tgg ggc gtc atg ttc 1411 
Glu Val He He Asp He Val Ser Gly Ala His Trp Gly Val Met Phe 
345 350 355 

ggc ttg gee tac ttc tct atg cag gga gcg tgg gcg aag gtc att gtc 145 9 
Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp Ala Lys Val He Val 
360 365 370 

ate ctt ctg ctg gee get ggg gtg gac gcg ggc acc acc acc gtt gga 1507 
He Leu Leu Leu Ala Ala Gly Val Asp Ala Gly Thr Thr Thr Val Gly 
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ggc get gtt gca cgt tec acc aac gtg att gec ggc gtg ttc age cat 1555 

Gly Ala Val Ala Arg Ser Thr Asn Val lie Ala Gly Val Phe Ser Hie 

390 395 400 405 

ggc cct cag cag aac att cag etc att aac acc aac ggc agt tgg cac 1603 

Gly Pro Gin Gin Asn lie Gin Leu rle Asn Thr Asn Gly Ser Trp His 
410 415 420 

ate aac cgt- act gee ttg aat tgc aat gac tec ttg aac acc ggc ttt 1651 

lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr Gly Phe 
425 430 435 

etc gcg gec ttg ttc tac acc aac cgc ttt aac teg tea ggg tgt cca 1699 

Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn Ser Ser Gly Cys Pro 
440 445 450 

ggg cgc ctg tec gec tgc cgc aac ate gag get ttc egg ata ggg tgg 1747 

Gly Arg Leu Ser Ala Cys Arg Asn lie Glu Ala Phe Arg lie Gly Trp 
455 460 465 

ggc acc eta cag tac gag gat aat gtc acc aat cca gag gat atg agg 1795 

Gly Thr Leu Gin Tyr Glu Asp Asn Val Thr Asn Pro Glu Asp Met Arg 

470 475 480 485 

ccg tac tgc tgg cac tac ccc cca aag ccg tgt ggc gta gtc ccc gcg 1843 

Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys Gly Val Val Pro Ala 
490 495 500 

agg tct gtg tgt ggc cca gtg tac tgt ttc acc ccc age ccg gta gta 1891 

Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 



74 



JP Application No. 2003-329 



gtg ggc acg acc gac aga cgt gga gtg ccc acc tac aca tgg gga gag 1939 

Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr Tyr Thr Trp Gly Glu 
520 525 530 

aat gag aca gat gtc ttc eta ctg aac age acc cga ccg ccg cag ggc 1987 

Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Gin Gly 
535 540 S45 

tea tgg ttc ggc tgc acg tgg atg aac tec act ggt ttc acc aag act 2035 

Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 

550 555 560 565 

tgt ggc gcg cca cct tgc cgc acc aga get gac ttc aac gee age acg 208 3 

Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp Phe Asn Ala Ser Thr 
570 575 580 

gac ttg ttg tgc cct acg gat tgt ttt agg aag cat cct gat gee act 2131 

Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr 
585 590 595 

tat att aag tgt ggt tct ggg ccc tgg etc aca cca aag tgc ctg gtc 2179 

Tyr lie Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Lys Cys Leu Val 
600 605 610 

cac tac cct tac aga etc tgg cat tac ccc tgc aca gtc aat ttt acc 222 7 

His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr 
615 620 625 

ate ttc aag ata aga atg tat gta ggg ggg gtt gag cac agg etc acg 2275 

lie Phe Lys He Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Thr 



75 



JP Application No. 2003-329115 



gcc gca tgc aac ttc act cgt ggg gat cgc tgc gac ttg gag gac agg 2323 

Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Aap Leu Glu Asp Arg 
650 655 660 

gac agg agt cag ctg tct cct ctg ttg cac tct acc acg gaa tgg gcc 2371 

Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser Thr Thr Glu Trp Ala 
665 670 675 

ate ctg ccc tgc acc tac tea gac tta ccc get ttg tea act ggt ctt 2419 

lie Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu 
680 685 690 

etc cac ctt cac cag aac ate gtg gac gta caa tac atg tat ggc etc 2467 

Leu His Leu His Gin Asn lie Val Asp Val Gin Tyr Met Tyr Gly Leu 
695 700 705 

tea cct get ate aca aaa tac gtc gtt cga tgg gag tgg gtg gta etc 2515 

Ser Pro Ala lie Thr Lys Tyr Val Val Arg Trp Glu Trp Val Val Leu 
710 715 720 725 

tta ttc ctg etc tta gcg gac gcc aga gtc tgc gcc tgc ttg tgg atg 2563 

Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met 
730 735 740 

etc ate ttg ttg ggc cag gcc gaa gca gca ttg gag aag ttg gtc gtc 2611 

Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu Glu Lys Leu Val Val 
745 750 755 

ttg cac get gcg agt gcg get aac tgc cat ggc etc eta tat ttt gcc 265 9 
Leu His Ala Ala Ser Ala Ala Asn Cys His Gly Leu Leu Tyr Phe Ala 
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ate ttc ttc gtg gca get tgg cac ate agg ggt egg gtg gtc ccc ttg 2707 

lie Phe Phe Val Ala Ala Trp His lie Arg Gly Arg Val Val Pro Leu 
775 780 785 

ace acc tat tgc etc act ggc eta tgg ccc ttc tgc eta ctg etc atg 2755 

Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe Cya Leu Leu Leu Met 

790 795 800 805 

gca ctg ccc egg cag get tat gee tat gac gca cct gtg cac gga cag 2803 

Ala Leu Pro Arg Gin Ala Tyr Ala Tyr Asp Ala Pro Val His Gly Gin 
810 815 820 

ata ggc gtg ggt ttg ttg ata ttg ate ace etc ttc aca etc acc ccg 2851 

He Gly Val Gly Leu Leu He Leu He Thr Leu Phe Thr Leu Thr Pro 
825 830 835 

ggg tat aag acc etc etc ggc cag tgt ctg tgg tgg ttg tgc tat etc 2899 

Gly Tyr Lya Thr Leu Leu Gly Gin Cys Leu Trp Trp Leu Cys Tyr Leu 
840 845 850 

ctg acc ctg ggg gaa gec atg att cag gag tgg gta cca ccc atg cag 2947 

Leu Thr Leu Gly Glu Ala Met He Gin Glu Trp Val Pro Pro Met Gin 
855 860 865 

gtg cgc ggc ggc cgc gat ggc ate gcg tgg gec gtc act ata ttc tgc 2995 

Val Arg Gly Gly Arg Asp Gly He Ala Trp Ala Val Thr He Phe Cys 

870 875 880 885 

ccg ggt gtg gtg ttt gac att acc aaa tgg ctt ttg gcg ttg ctt ggg 3043 

Pro Gly Val Val Phe Asp He Thr Lys Trp Leu Leu Ala Leu Leu Gly 
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890 895 

cct get tac etc tta agg gec get ttg aca 

Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr 

905 910 

aga get cac get ctg ata agg gta tgc get 

Arg Ala His Ala Leu lie Arg Val Cys Ala 
920 925 
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900 

cat gtg cog tac ttc gtc 3091 
His Val Pro Tyr Phe Val 
915 

ttg gtg aag cag etc gcg 3139 
Leu Val Lys Gin Leu Ala 
93 0 



ggg ggt agg tat gtt cag gtg gcg eta ttg gee ctt ggc agg tgg act 3187 
Gly Gly Arg Tyr Val Gin Val Ala Leu Leu Ala Leu Gly Arg Trp Thr 
935 940 945 



ggc acc tac ate tat gac cac etc aca cct atg teg gac tgg gee get 3235 
Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met Ser Asp Trp Ala Ala 
950 955 960 965 



age ggc ctg cgc gac tta gcg gtc gee gtg gaa ccc ate ate ttc agt 3283 
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro He He Phe Ser 
970 975 980 



ccg atg gag aag aag gtc ate gtc tgg gga gcg gag acg get gca tgt 333 1 
Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 



ggg gac att eta cat gga ctt ccc gtg tec gec cga etc ggc cag gag 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Gin Glu 
1000 1005 1010 



ate etc etc ggc cca get gat ggc tac acc tec aag ggg tgg aag etc 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
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ctt get ccc ate act get tat gec cag caa aca cga ggc etc ctg ggc 3475 
Leu Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 1040 1045 

gec ata gtg gtg agt atg acg ggg cgt gac agg aca gaa cag gec ggg 3523 
Ala He val Val Ser Met Thr Gly Arg Asp Arg Thr Glu Gin Ala Gly 
1050 1055 1060 

gaa gtc caa ate ctg tec aca gtc tct cag tec ttc etc gga aca acc 3571 
Glu Val Gin He Leu Ser Thr Val Ser Gin Ser Phe Leu Gly Thr Thr 
1065 1070 1075 

ate teg ggg gtt ttg tgg act gtt tac cac gga get ggc aac aag act 3 619 
He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 

eta gec ggc tta egg ggt ccg gtc acg cag atg tac teg agt get gag 3667 
Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
1095 1100 1105 

ggg gac ttg gta ggc tgg ccc age ccc cct ggg acc aag tct ttg gag 3715 
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1110 1115 1120 1125 

ccg tgc aag tgt gga gec gtc gac eta tat ctg gtc acg egg aac get 3763 
Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 1135 1140 

gat gtc ate ccg get egg aga cgc ggg gac aag egg gga gca ttg etc 3 811 
Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 
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tec ccg aga ccc att teg acc ttg aag ggg tec teg ggg ggg ccg gtg 3 859 
Ser Pro Arg Pro lie Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 

etc tgc cct agg ggc cac gtc gtt ggg etc ttc cga gca get gtg tgc 3907 
Leu Cys Pro Arg Gly His Val Val Gly Leu Phe Arg Ala Ala Val Cys 
1175 1180 1185 

tct egg ggc gtg gec aaa tec ate gat ttc ate ccc gtt gag aca etc 3955 
Ser Arg Gly Val Ala Lyg Ser He Asp Phe lie Pro Val Glu Thr Leu 
1190 1195 1200 1205 

gac gtt gtt aca agg tct ccc act ttc agt gac aac age acg cca ccg 4003 
Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 
1210 1215 1220 



get gtg ccc cag acc tat cag gtc ggg 
Ala Val Pro Gin Thr Tyr Gin Val Gly 
1225 1230 

agt'gga aag age acc aag gtc cct gtc 
Ser Gly Lys Ser Thr Lys Val Pro Val 
1240 1245 

aaa gta eta gtg ctt aac ccc teg gta 
Lys Val Leu Val Leu Asn Pro Ser Val 
1255 1260 

gcg tac eta tec aag gca cat ggc ate 
Ala Tyr Leu Ser Lys Ala His Gly He 



tac ttg cat get cca act ggc 4051 

Tyr Leu His Ala Pro Thr Gly 

1235 

gcg tat gec gee cag ggg tac 4 09 9 

Ala Tyr Ala Ala Gin Gly Tyr 
1250 

get gee acc ctg ggg ttt ggg 4147 

Ala Ala Thr Leu Gly Phe Gly 
1265 

aat ccc aac att agg act gga 4195 

Asn Pro Asn He Arg Thr Gly 
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gtc agg acc gtg atg acc ggg gag gcc ate acg tac tec aca tat ggc 4243 
Val Arg Thr Val Met Thr Gly Glu Ala lie Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 

aaa ttt etc gcc gat ggg ggc tgc get age ggc gcc tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly Ala Tyr Asp He He 
1305 1310 1315 

ata tgc gat gaa tgc cac get gtg gat get acc tec att etc ggc ate 4339 
He Cys Asp Glu Cys His Ala Val Asp Ala Thr Ser He Leu Gly He 
1320 1325 1330 

gga acg gtc ctt gat caa gca gag aca gcc ggg gtc aga eta act gtg 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr Val 
1335 1340 1345 

ctg get acg gcc aca ccc ccc ggg tea gtg aca acc ccc cat ccc gat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asp 
1350 1355 1360 1365 

ata gaa gag gta ggc etc ggg egg gag ggt gag ate ccc ttc tat ggg 4483 
He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu He Pro Phe Tyr Gly 
1370 1375 1380 

agg gcg att ccc eta tec tgc ate aag gga ggg aga cac ctg att ttc 4531 
Arg Ala He Pro Leu Ser Cys He Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gcg gcg gcc ctt egg ggc 457 9 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Ala Leu Arg Gly 
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atg ggc ttg aat gcc gtg gca tac tat aga ggg ttg gac gtc tec ata 4627 

Met Gly Leu flsn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser He 
1415 1420 1425 

ata cca get cag gga gat gtg gtg gtc gtc gcc acc gac gcc etc atg 4675 

He Pro Ala Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 

acg ggg tac act gga gac ttt gac tec gtg ate gac tgc aat gta gcg 4723 

Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val lie Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc caa get gtc gac ttc age ctg gac ccc acc ttc act ata acc 4771 

Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 

1465 1470 1475 

aca cag act gtc cca caa gac get gtc tea cgc agt cag cgc cgc ggg 4819 

Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
1480 1485 1490 

cgc aca ggt aga gga aga cag ggc act tat agg tat gtt tec act ggt 4867 

Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1505 

gaa cga gcc tea gga atg ttt gac agt gta gtg ctt tgt gag tgc tac 4915 

Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 1515 1520 1525 

gac gca ggg get gcg tgg tac gat etc aca cca gcg gag acc acc gtc 4963 

Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr Val 
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agg ctt aga gcg tat ttc aac acg ccc ggc eta ccc gtg tgt caa gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 1555 

cat ctt gaa ttt tgg gag gca gtt ttc acc ggc etc aca cac ata gac 505 9 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His He Asp 
1560 1565 1570 

gec cac ttc etc tec caa aca aag caa gcg ggg gag aac ttc gcg tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Glu Asn Phe Ala Tyr 
1575 1580 1585 

ota gta gec tac caa get acg gtg tgc gec aga gec aag gee cct ccc 51SS 
Leu Val Ala Tyr Gin Ala Thr val Cys Ala Arg Ala Lys Ala Pro Pro 
1590 1595 1600 1605 

ccg tec tgg gac gee atg tgg aag tgc ctg gec cga etc aag cct acg 5203 
Pro Ser Trp Aap Ala Met Trp Lys Cys Leu Ala Arg Leu Lys Pro Thr 
1610 1615 1620 

ctt gcg ggc ccc aca cct etc ctg tac cgt ttg ggc cct att acc aat 5251 
Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro He Thr Asn 
1625 1630 1635 

gag gtc acc etc aca cac cct ggg acg aag tac ate gec aca tgc atg 5299 
Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr He Ala Thr Cys Met 
1640 1645 1650 

caa get gac ctt gag gtc atg acc age acg tgg gtc eta get gga gga 534 7 
Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
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gtc ctg gca gcc gtc gcc gca tat tgc ctg gcg act gga tgc gtt tec 5395 

Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 

1670 1675 1680 1685 

ate ate ggc cgc ttg cac gtc aac cag cga gtc gtc gtt gcg ccg gat 5443 

He He Gly Arg Leu His Val Asn Gin Arg Val Val Val Ala Pro Asp 
1690 169S 1700 

aag gag gtc ctg tat gag get ttt gat gag atg gag gaa tgc gcc tct 5491 

Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 

agg gcg get etc ate gaa gag ggg cag egg ata gcc gag atg ttg aag 553 9 

Arg Ala Ala Leu He Glu Glu Gly Gin Arg He Ala Glu Met Leu Lys 
1720 1725 1730 

tec aag ate caa ggc ttg ctg cag cag gcc tct aag cag gcc cag gac 5587 

Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
1735 1740 1745 

ata caa ccc get atg cag get tea tgg ccc aaa gtg gaa caa ttt tgg 563 5 

lie Gin Pro Ala Met Gin Ala Ser Trp Pro Lys Val Glu Gin Phe Trp 

1750 1755 1760 1765 

gcc aga cac atg tgg aac ttc att age ggc ate caa tac etc gca gga 5683 

Ala Arg His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
1770 1775 1780 

ttg tea aca ctg cca ggg aac ccc gcg gtg get tec atg atg gca ttc 5731 

Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
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agt gcc gcc etc acc agt ccg ttg teg acc agt acc acc ate ctt etc 5779 

Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr He Leu Leu 
1800 1805 1810 

aac ate atg gga ggc tgg tta gcg tec cag ate gca cca ccc gcg ggg S827 

Asn He Met Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Ala Gly 
1815 1820 1825 

gcc acc ggc ttt gtc gtc agt ggc ctg gtg ggg get gcc gtg ggc age 587 5 

Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 

1830 1835 1840 1845 

ata ggc ctg ggt aag gtg ctg gtg gac ate ctg gca gga tat ggt gcg 592 3 

He Gly Leu Gly Lye Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
1850 18 5 5 1860 

ggc att teg ggg gcc etc gtc gca ttc aag ate atg tct ggc gag aag 5971 

Gly He Ser Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Lys 
1865 1870 1875 

ccc tct atg gaa gat gtc ate aat eta ctg cct ggg ate ctg tct ccg 6019 

Pro Ser Met Glu Asp Val He Asn Leu Leu Pro Gly He Leu Ser Pro 
1880 1885 1890 

gga gcc ctg gtg gtg ggg gtc ate tgc gcg gcc att ctg cgc cgc cac 6067 

Gly Ala Leu Val Val Gly Val He Cys Ala Ala He Leu Arg Arg His 
1895 1900 1905 

gtg gga ccg ggg gag ggc gcg gtc caa tgg atg aac agg ctt att gcc 6115 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
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1910 1915 1920 1925 

ttt got tec aga gga aac cac gtc gec cct act cac tac gtg acg gag 6153 

Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 

1930 193S 1940 

teg gat gcg teg cag cgt gtg acc caa eta ctt ggc tct ctt act ata 5211 

Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr He 

1945 19S0 1955 

acc age eta etc aga aga etc cac aat tgg ata act gag gac tgc ccc 6259 

Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 

1360 1965 1970 

ate cca tgc tec gga tec tgg etc cgc gac gtg tgg gac tgg gtt tgc 6307 

He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 

1975 1980 1985 

acc ate ttg aca gac ttc aaa aat tgg ctg acc tct aaa ttg ttc ccc 6355 

Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 

1990 1995 2000 2005 

aag ctg ccc ggc etc ccc ttc ate tct tgt caa aag ggg tac aag ggt 6403 

Lys Leu Pro Gly Leu Pro Phe He Ser Cys Gin Lys Gly Tyr Lys Gly 

2010 2015 2020 

gtg tgg gec ggc act ggc ate atg acc acg cgc tgc cct tgc ggc gee 6451 

Val Trp Ala Gly Thr Gly He Met Thr Thr Arg Cys Pro Cys Gly Ala 

2025 2030 2035 

aac ate tct ggc aat gtc cgc ctg ggc tct atg agg ate aca ggg cct 649 9 

Asn He Ser Gly Asn Val Arg Leu Gly Ser Met Arg lie Thr Gly Pro 
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aaa acc tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgc tac 5547 
Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro He Asn Cys Tyr 
2055 2060 2065 

acg gag ggc cag tgc gcg ccg aaa ccc ccc acg aac tac aag acc gec 6595 
Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr Asn Tyr Lys Thr Ala 
2070 2075 2080 2085 

ate tgg agg gtg gcg gec teg gag tac gcg gag gtg acg cag cat ggg 6643 
He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 

teg tac tec tat gta aca gga ctg acc act gac aat ctg aaa att cct 6691 
Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp Asn Leu Lys He Pro 
2105 2110 2115 

tgc caa eta cct tct cca gag ttt ttc tec tgg gtg gac ggt gtg cag 6739 
Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 
2120 2125 2130 

ate cat agg ttt gca ccc aca cca aag ccg ttt ttc egg gat gag gtc 6787 
He His Arg Phe Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 
2135 2140 2145 

teg ttc tgc gtt ggg ctt aat tec tat get gtc ggg tec cag ctt ccc 6835 
Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val Gly Ser Gin Leu Pro 
2150 215S 2160 2165 

tgt gaa cct gag ccc gac gca gac gta ttg agg tec atg eta aca gat 6883 
Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg Ser Met Leu Thr Asp 
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cog ccc cac ate acg gcg gag act gcg gcg egg cgc ttg gca egg gga 693 1 

Pro Pro Bis lie Thr Ala Glu Thr Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 

tea cct cca tct gag gcg age tec tea gtg age cag eta tea gca ccg 6979 

Ser Pro Pro Ser Glu Ala Ser Ser Ser val Ser Gin Leu Ser Ala Pro 
2200 2205 2210 

teg ctg egg gee acc tgc ace ace cac age aac acc tat gac gtg gac 7027 

Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn Thr Tyr Asp Val Asp 
2215 2220 2225 

atg gtc gat gec aac ctg etc atg gag ggc ggt gtg get cag aca gag 707 5 

Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly Val Ala Gin Thr Glu 
2230 2235 2240 2245 

cct gag tec agg gtg ccc gtt ctg gac ttt etc gag cca atg gec gag 7123 

Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu Glu Pro Met Ala Glu 
2250 2255 2250 

gaa gag age gac ctt gag ccc tea ata cca teg gag tgc atg etc ccc 717 1 

Glu Glu Ser Asp Leu Glu Pro Ser lie Pro Ser Glu Cys Met Leu Pro 
2265 2270 2275 

agg age ggg ttt cca egg gec tta ccg get tgg gca egg cct gac tac 7219 

Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 

aac ccg ccg etc gtg gaa teg tgg agg agg cca gat tac caa ccg ccc 7267 

Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro Asp Tyr Gin Pro Pro 
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acc gtt get ggt tgt get etc ccc ccc ccc aag aag goc ccg acg cct 7315 

Thr val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lye Ala Pro Thr Pro 

2310 2315 2320 2325 

ccc cca agg aga cgc egg aca gtg ggt ctg age gag age acc ata tea 7363 

Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr He Ser 
2330 2335 2340 

gaa gec etc cag caa ctg gec ate aag acc ttt ggc cag ccc ccc teg 7411 

Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe Gly Gin Pro Pro Ser 
2345 2350 2355 

age ggt gat gca ggc teg tec acg ggg gcg ggc gec gec gaa tec ggc 745 9 

Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly Ala Ala Glu Ser Gly 
2360 2365 2370 

ggt ccg acg tec cct ggt gag ccg gee ccc tea gag aca ggt tec gec 7507 

Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser Glu Thr Gly Ser Ala 
2375 2380 2385 

tec tct atg ccc ccc etc gag ggg gag cct gga gat ccg gac ctg gag 755 5 

Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 

2390 2395 2400 2405 

tct gat cag gta gag ctt caa cct ccc ccc cag ggg ggg ggg gta get 7603 

Ser Aap Gin val Glu Leu Gin Pro Pro Pro Gin Gly Gly Gly Val Ala 
2410 2415 2420 

ccc ggt teg ggc teg ggg tct tgg tct act tgc tec gag gag gac gat 7651 

Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr cys Ser Glu Glu Asp Asp 
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acc acc gtg tgc tgc tec atg tea tac tec tgg acc ggg get eta ata 7699 
Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu He 
2440 2445 2450 

act ccc tgt age ccc gaa gag gaa aag ttg cca ate aac cct ttg agt 7747 
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro He Asn Pro Leu Ser 
2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gtg tac tgt aca aca tea aag 7795 
Asn Ser Leu Leu Arg Tyr Hia Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 2480 2485 

age gec tea cag agg get aaa aag gta act ttt gac agg acg caa gtg 7843 
Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe Asp Arg Thr Gin Val 
2490 2495 2500 

etc gac gec cat tat gac tea gtc tta aag gac ate aag eta gcg get 7891 
Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp lie Lys Leu Ala Ala 

2505 2510 2515 

tec aag gtc age gca agg etc etc acc ttg gag gag gcg tgc cag ttg 793 9 
Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

act cca ccc cat tct gca aga tec aag tat gga ttc ggg gec aag gag 7987 
Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 
2S35 2540 2545 

gtc cgc age ttg tec ggg agg gec gtt aac cac ate aag tec gtg tgg 803 5 
Val Arg Ser Leu Ser Gly Arg Ala Val Asn His He Lys Ser Val Trp 
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aag gac etc ctg gaa gac cca caa aca cca att ccc aca acc ate atg 8083 
Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro lie Pro Thr Thr lie Met 
2570 2575 2580 



gec aaa aat gag gtg ttc tgc gtg gac ccc gec aag ggg ggt aag aaa 8131 
Ala Lys Asn Glu Val Phe Cy3 Val Asp Pro Ala Lys Gly Gly Lys Lys 
2585 2590 2595 



cca get cgc etc ate gtt tac cct gac etc ggc gtc egg gtc tgc gag 817 9 
Pro Ala Arg Leu lie Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 260S 2610 



aaa atg gec etc tat gac att aca caa aag ctt cct cag gcg gta atg 822 7 
Lys Met Ala Leu Tyr Asp lie Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 2620 2625 



gga get tec tat ggc ttc cag tac tec cct gec caa egg gtg gag tat 8275 
Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Tyr 
2630 2635 2640 2645 



etc ttg aaa gca tgg gcg gaa aag aag gac ccc atg ggt ttt teg tat 8323 
Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro Met Gly Phe Ser Tyr 
2650 2655 2660 



gat acc cga tgc ttc gac tea acc gtc act gag aga gac ate agg acc 8371 
Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp He Arg Thr 
2665 2670 2675 



gag gag tec ata tac cag gec tgc tec ctg ccc gag gag gec cgc act 8419 
Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 
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gcc ata cac teg ctg act gag aga ctt tac gta gga ggg ccc atg ttc 8467 

Ala lie His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 

aac age aag ggt caa acc tgc ggt tac aga cgt tgc cgc gcc age ggg 8515 

Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 

2710 2715 2720 2725 

gtg eta acc act age atg ggt aac acc ate aca tgc tat gtg aaa gcc 8563 

Val Leu Thr Thr Ser Met Gly Asn Thr lie Thr Cys Tyr Val Lys Ala 

2730 2735 2740 

eta gcg gcc tgc aag get gcg ggg ata gtt gcg ccc aca atg ctg gta 8611 

Leu Ala Ala Cys Lys Ala Ala Gly He Val Ala Pro Thr Met Leu Val 

2745 27S0 2755 

tgc ggc gat gac eta gta gtc ate tea gaa age cag ggg act gag gag 8659 

Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 
2760 2765 2770 

gac gag egg aac ctg aga gcc ttc acg gag gcc atg acc agg tac tct 8707 

Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 

gcc cct cct ggt gat ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 

Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu He 

2790 2795 2800 2805 

aca tec tgt tec tea aat gtg tct gtg gcg ttg ggc ccg egg ggc cgc 8803 

Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Arg Gly Arg 



92 



JP Application No. 2003-329115 



cgc aga tac tac ctg acc aga gac cca acc act cca etc gec egg get 8851 
Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala 
2825 2830 2835 

gec tgg gaa aca gtt aga cac tec cct ate aat tea tgg ctg gga aac 8899 
Ala Trp Glu Thr Val Arg His Ser Pro lie Asn Ser Trp Leu Gly Asn 
2840 2845 2850 

ate ate cag tat get cca acc ata tgg gtt cgc atg gtc eta atg aca 8947 
lie lie Gin Tyr Ala Pro Thr He Trp Val Arg Met Val Leu Met Thr 
2855 2860 2865 

cac ttc ttc tec att etc atg gtc caa gac acc ctg gac cag aac etc 8995 
His Phe Phe Ser He Leu Met Val Gin Asp Thr Leu Asp Gin Asn Leu 
2870 2875 2880 2885 

aac ttt gag atg tat gga tea gta tac tec gtg aat cct ttg gac ctt 9043 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Asn Pro Leu Asp Leu 
2890 2895 2900 

coa gec ata att gag agg tta cac ggg ctt gac gec ttt tct atg cac 9091 
Pro Ala He He Glu Arg Leu His Gly Leu Asp Ala Phe Ser Met His 
2905 2910 2915 

aca tac tct cac cac gaa ctg acg egg gtg get tea gec etc aga aaa 913 9 
Thr Tyr Ser His His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

ctt ggg gcg cca ccc etc agg gtg tgg aag agt egg get cgc gca gtc 9187 
Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser Arg Ala Arg Ala Val 
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agg gcg tec etc ate tec cgt gga ggg aaa gcg gec gtt tgc ggc cga 923 5 
Arg Ala Ser Leu lie Ser Arg Gly Gly Lys Ala Ala Val Cysj Gly Arg 
2950 2955 2960 2965 

tat etc ttc aat tgg gcg gtg aag acc aag etc aaa etc act cca ttg 9283 
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 297S 2980 

ccg gag gcg cgc eta ctg gac tta tec agt tgg ttc acc gtc ggc gec 9331 
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att ttt cac ago gtg teg cgc gec cga ccc cgc tea 937 9 
Gly Gly Gly Asp lie Phe His Ser Val Ser Arg Ala Arg Pro Arg Ser 
3000 3005 3010 

tta etc ttc ggc eta etc eta ctt ttc gta ggg gta ggc etc ttc eta 9427 
Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly Val Gly Leu Phe Leu 
3015 3020 3025 

etc ccc get egg tag agcggcacac actaggtaca ctccatagct aactgttcct 9482 
Leu Pro Ala Arg 
3 03 0 

tttttttttt tttttttttt tttttttttt tttttttttt ttcttttttt tttttttCCC 9542 
tctttcttcc cttctcatct tattctactt tctttcttgg tggctccatc ttagecctag 9602 
teaeggctag ctgtgaaagg tccgtgagcc geatgactge agagagtgee gtaactggtc 9662 
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tctctgcaga tcatgt 9578 

<210> 4 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

15 10 15 

Arg Arg Pro Glu Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly 

65 70 75 80 

Arg Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Px-o Thr Asp Pro 

100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Val Val Gly Ala Pro Leu 

130 135 140 

Ser Gly Ala Ala Arg Ala Val Ala His Gly Val Arg Val Leu Glu Asp 

145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser lie 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys lie Thr val Pro Val Ser Ala Ala 
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180 1B5 190 

Gin Val Lys Asn Thr Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser 

195 200 205 

Asn Asp Ser lie Thr Trp Gin Leu Glu Ala Ala Val Leu His Val Pro 

210 215 220 

Gly Cys Val Pro Cys Glu Arg Val Gly Aan Thr Ser Arg Cys Trp Val 
225 230 235 240 

Pro Val Ser Pro Asn Met Ala Val Arg Gin Pro Gly Ala Leu Thr Gin 

245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Phe Cys 

260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 

275 280 285 

Gin Val Phe He Val Ser Pro Gin Tyr His Trp Phe Val Gin Glu Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 

325 330 335 

Val Met Arg Val Pro Glu Val He Xle Asp He Val Ser Gly Ala His 

340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 

355 360 365 

Ala Lys Val He Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Gly 

370 375 380 

Thr Thr Thr Val Gly Gly Ala Val Ala Arg Ser Thr Asn Val He Ala 
385 390 395 400 

Gly Val Phe Ser His Gly Pro Gin Gin Asn He Gin Leu He Asn Thr 

405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn 
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435 440 445 

Ser Ser Gly Cys Pro Gly Arg Leu Ser Ala Cys Arg Asa lie Glu Ala 

450 455 460 

Phe Arg lie Gly Trp Gly Thr Leu Gin Tyr Glu Aap Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys 

485 490 495 

Gly Val Val Pro Ala Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr 

500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr 

515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Gin Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr 

545 550 555 560 

Gly Phe Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 

565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 

580 585 590 

His Pro Asp Ala Thr Tyr lie Lys Cys Gly Ser Gly Pro Trp Leu Thr 

595 600 605 

Pro Lys Cys Leu Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 

610 615 620 

Thr Val Asn Phe Thr He Phe Lys Xle Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 

645 650 655 

Asp Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 

660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala 

675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
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690 695 700 

Tyr Met Tyr Gly Leu Ser Pro Ala lie Thr Lys Tyr Val Val Arg Trp 
705 710 715 720 

Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 

725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 

740 745 750 

Glu Lya Leu Val Val Leu His Ala Ala Ser Ala Ala Asn Cys His Gly 

755 760 765 

Leu Leu Tyr Phe Ala lie Phe Phe Val Ala Ala Trp His lie Arg Gly 

770 775 780 

Arg Val Val Pro Leu Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe 
785 790 795 800 

Cys Leu Leu Leu Met Ala Leu Pro Arg Gin Ala Tyr Ala Tyr Asp Ala 

805 810 815 

Pro Val His Gly Gin He Gly Val Gly Leu Leu He Leu He Thr Leu 

820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Gly Gin Cys Leu Trp 

835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met lie Gin Glu Trp 

850 855 860 

Val Pro Pro Met Gin Val Arg Gly Gly Arg Asp Gly He Ala Trp Ala 
865 870 875 880 

Val Thr He Phe Cys Pro Gly Val Val Phe Asp He Thr Lys Trp Leu 

885 890 • 895 

Leu Ala Leu Leu Gly Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His 

900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu He Arg Val Cys Ala Leu 

915 920 925 

Val Lys Gin Leu Ala Gly Gly Arg Tyr Val Gin Val Ala Leu Leu Ala 

930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr lie Tyr Asp His Leu Thr Pro Met 
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945 950 955 960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 

965 970 975 

Pro lie lie Phe Ser Pro Met Glu Lys Lys Val lie Val Trp Gly Ala 

980 985 990 

Glu Thr Ala Ala Cys Gly Asp lie Leu His Gly Leu Pro Val Ser Ala 

995 1000 1005 

Arg Leu Gly Gin Glu He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 

1010 1015 1020 

Lys Gly Trp Lys Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Ala He Val Val Ser Met Thr Gly Arg Asp Arg 

1045 1050 10SS 

Thr Glu Gin Ala Gly Glu Val Gin He Leu Ser Thr Val Ser Gin Ser 

1060 1065 1070 

Phe Leu Gly Thr Thr He Ser Gly Val Leu Trp Thr Val Tyr His Gly 

1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met 

1090 1095 HOO 

Tyr ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Lys Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

Val Thr Arg Asn Ala Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Val Val Gly Leu Phe 

1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Aap Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp 
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1205 1210 1215 

Asn Ser Tbr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 

1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Met Thr Gly Glu Ala He Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly 

1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ala Val Asp Ala Thr 

1315 1320 132b 

Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 

134S 1350 1355 1360 

Thr Pro His Pro Asp He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala He Pro Leu Ser Cys He Lys Gly Gly 

1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Ala Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 14S0 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 
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1460 1465 1470 

Thr Phe Thr lie Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro 

1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Pro He Thr Asn Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His Val Asn Gin Arg Val 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly Gin Arg He 
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1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys 

1745 1750 1755 1760 

Val Glu Gin Phe Trp Ala Arg His Met Trp Asn Phe He Ser Gly He 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr He Leu Leu Asn He Met Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 

1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 

1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 

1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val 
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1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Leu Pro Gly Leu Pro Phe He Ser Cys Gin 

200S 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 

2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr 
2065 2070 2075 2080 

Asn Tyr Lys Thr Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp 

2100 210S 2110 

Asn Leu Lys He Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin He His Arg Phe Ala Pro Thr Pro Ly3 Pro Phe 

2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val 
2145 21S0 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Pro His He Thr Ala Glu Thr Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn 

2210 2215 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly 
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2225 2230 2235 2240 

Val Ala Gin Thr Glu Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu 

2245 22S0 2255 

Glu Pro Met Ala Glu Glu Glu Ser Asp Leu Glu Pro Ser He Pro Ser 

2260 2265 2270 

Glu Cys Met Leu Pro Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro 

2290 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 

2305 2310 2315 2320 

Lys Ala Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Ser Thr He Ser Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe 

2340 2345 2350 

Gly Gin Pro Pro Ser Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly 

2355 2360 2365 

Ala Ala Glu Ser Gly Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser 

2370 2375 2380 

Glu Thr Gly Ser Ala Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Gly val Ala Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys 

2420 2425 2430 

Ser Glu Glu Asp Asp Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

He Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe 
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2485 2490 2495 

Asp Arg Thr Gin Val Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp 

2500 2505 2510 

He Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 25S0 2555 2560 

He Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro He 

2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 
2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly 

259S 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 

2625 2630 2635 2640 

Gin Arg Val Glu Tyr Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 

2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp He Arg Thr Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 2685 

Glu Glu Ala Arg Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly He Val Ala 
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2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 

2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 281S 

Gly Pro Arg Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Pro Leu Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro He Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Val Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Val Gin Asp Thr 

2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 

2885 2890 2895 

Asn Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Met His Thr Tyr Ser His His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Lys Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp lie Phe His Ser Val Ser Arg 
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2995 3000 30OS 

Ala Arg Pro Arg Ser Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 

3010 301B 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
302S 3030 



<210> S 
<211> 9674 
<212> DNA 

<213> Hepatitis C virus 

<220> 
<221> CDS 

<222> (341) . . (9442) 
<400> 5 

acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg age aca aat ccc 35S 
Met Ser Thr Asn Pro 



aaa cct caa aga aaa acc aaa aga ; 



: act aac cgt cgc cca caa gac 403 
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Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 



gtt aag ttt ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 451 
Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly Gly Val Tyr Leu Leu 



cog cgc agg ggc ccc agg ttg ggt gtg cgc gcg aca agg aag get teg 4 99 
Pro Arg Arg Gly pro Arg Leu Gly Val Arg Ala Thr Arg Lys Ala Ser 



gag egg tec cag cca cgt ggg agg cgc cag ccc ate ccc aaa cat egg 547 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro lie Pro Lys His Arg 



cgc tec act ggc aag tec tgg ggg aag cca gga tac ccc tgg ccc ctg 595 
Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly Tyr Pro Trp Pro Leu 



tat ggg aat gag ggg etc ggt tgg gca gga tgg etc ctg tec cct cga 643 

Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
90 95 100 

ggt tec cgt ccc tea tgg ggc ccc aat gac ccc egg cat agg teg cgc 691 

Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro Arg His Arg Ser Arg 
105 no 115 

aat gtg ggt aag gtc ate gat acc eta acg tgc ggc ttt gee gac etc 739 

Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
120 125 130 

ttg ggg tac gtc ccc gtc gta ggc gee ccg ctt agt ggc gtt gec agt 787 
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Leu Gly Tyr Val Pro Val Val Gly Ala Pro Leu Ser Gly Val Ala Ser 
135 140 145 

get etc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat ttt gca 835 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Phe Ala 
150 155 ISO 165 

aca ggg aac tta cct ggt tgc tec ttt tct ate ttc ttg ctg gec eta 883 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
170 175 180 

ctg tec tgc ate act act ccg gtc tct get gtc caa gtg aag aac acc 931 
Leu Ser Cys He Thr Thr Pro Val Ser Ala Val Gin Val Lys Asn Thr 



age aac gee tat atg gcg act aac gac tgt tec aat gac age ate act 979 
Ser Asn Ala Tyr Met Ala Thr Asn Asp Cys Ser Asn Asp Ser He Thr 

200 205 210 

tgg cag ctt gag gec gca gtc etc cat gtc ccc ggg tgc gtc ccg tgc 1027 
Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 
215 220 225 

gag aaa atg ggg aac aca tea egg tgc tgg ata cca gtc tea cca aac 107 5 
Glu Lys Met Gly Asn Thr Ser Arg Cys Trp He Pro Val Ser Pro Asn 
230 235 240 245 

gtg get gtg egg cag cct ggc gee etc acg egg ggc ttg egg acg cac 1123 
Val Ala Val Arg Gin Pro Gly Ala Leu Thr Arg Gly Leu Arg Thr His 
250 2S5 260 

ate gac atg gtc gtg ttg tec gec acg etc tgc tec get etc tac gtg 1171 
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He Asp Met Val Val Leu Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val 
265 270 275 

ggg gac etc tgt ggc ggg gtg atg etc gcg tec cag atg ttc att gtc 1219 

Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ser Gin Met Phe He Val 
280 285 290 

teg ccg cag cac cac tgg ttc gtg cag gaa tgc aat tgc tec ate tac 1257 

Ser Pro Gin His His Trp Phe Val Gin Glu Cys Asn Cys Ser He Tyr 
295 300 305 

cct ggc gec ate act ggg cac cgt atg gca tgg gac atg atg atg aac 1315 

Pro Gly Ala He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
310 315 320 325 

tgg teg ccc acg ace ace atg ate ctg gcg tac gtg atg cgc gtt ccc 1363 

Trp Ser Pro Thr Thr Thr Met He Leu Ala Tyr Val Met Arg Val Pro 



gag gtc ate ata gac ate att age gga get cac tgg ggc gtc atg ttt 1411 

Glu Val He He Asp lie He Ser Gly Ala His Trp Gly Val Met Phe 
345 350 355 

ggc ctg gee tac ttc tct atg cag gga gcg tgg gcg aag gtc gtt gtc 1459 

Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp Ala Lys Val Val Val 
360 365 370 

ate etc ctg ctg gec tct ggg gtg gac gcg tac ace acc acg act ggg 1507 

He Leu Leu Leu Ala Ser Gly Val Asp Ala Tyr Thr Thr Thr Thr Gly 
375 380 385 

age get get ggg cgc act acc agt age ctg gee age gec ttc tec cct 1555 
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Ser Ala Ala Gly Arg Thr Thr Ser Ser Leu Ala Ser Ala Phe Ser Pro 
390 395 400 405 

ggc get egg cag aac att cag etc att aat acc aat ggt age tgg cac 1603 

Gly Ala Arg Gin Asn He Gin Leu He Asn Thr Asn Gly Ser Trp His 
410 41S 420 

ate aac cgc acc gee ctg aat tgc aac gat tec ttg cac acc ggc ttc 1651 

lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu His Thr Gly Phe 
425 430 435 

ttc acg gee ctg ttc tac ate cat aag ttc aac teg teg gga tgt ccc 1699 

Phe Thr Ala Leu Phe Tyr He His Lys Phe Asn Ser Ser Gly Cys Pro 
440 445 450 

gag cgc ctg tec gec tgt cgc aac ate gag gac ttc egg ata gga tgg 174 7 

Glu Arg Leu Ser Ala Cys Arg Asn He Glu Asp Phe Arg He Gly Trp 
455 460 465 

ggc gec ctg caa tac gac gac aat gtc acc aat cca gaa gat atg agg 1795 

Gly Ala Leu Gin Tyr Asp Asp Asn Val Thr Asn Pro Glu Asp Met Arg 
470 475 480 485 

cca tat tgc tgg cac tac cca cca aaa cag tgt ggc gta gtc ccc gca 1843 

Pro Tyr Cys Trp His Tyr Pro Pro Lys Gin Cys Gly Val Val Pro Ala 
490 495 500 

ggg acc gtg tgc ggc cca gtg tac tgt ttc acc cct age ccg gtg gta 1891 

Gly Thr Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
505 510 515 

gtg ggc acg acc gat aga ctt gga gtg cct act tac acg tgg gga gag 193 9 
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Val Gly Thr Thr Asp Arg Leu Gly Val Pro Thr Tyr Thr Trp Gly Glu 
520 525 530 

aat gag aca gat gtc ttc eta ttg aac age acc cga cca ccg teg ggg 1987 
Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Ser Gly 
535 540 545 

tea tgg ttt ggc tgc acg tgg atg aac tec act ggc ttc acc aag acc 2035 
Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 
S50 555 560 565 

tgc ggc gca cca ccc tgc cgc act aga get gac ttc aat acc age aca 208 3 
Cya Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp Phe Asn Thr Ser Thr 
570 575 580 

gat ctg ttg tgc ccc acg gac fcgt ttt aga aaa cat cct gaa gec act 2131 
Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr 

585 590 595 

tac ate aaa tgt ggt tec ggg cct tgg etc acg cca aag tgt ctg gtt 2179 
Tyr lie Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Lys Cys Leu Val 
600 605 610 

gac tac ccc tac agg etc tgg cat tac cct tgc aca gtc aat tac tec 2227 
Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Tyr Ser 
615 620 625 

acc ttc aag ate agg atg tat gtg ggg gga gtt gag cac agg etc atg 2275 
Thr Phe Lys lie Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Met 
630 635 640 645 

gec gcg tgc aat ttc act cgt ggg gat cgc tgc aac ttg gag gat agg 2323 
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Ala Ala Cya Asn Phe Thr Arg Gly Asp Arg- Cys Asn Leu Glu Asp Arg 
650 655 660 

gac aga agt caa cag act cct ctg ttg cac tec acc acg gaa tgg gec 2371 
Asp Arg Ser Gin Gin Thr Pro Leu Leu His Ser Thr Thr Glu Trp Ala 
665 670 675 

att ttg ccc tgc tct ttc tea gac ttg ccc get ttg teg act ggt ctt 2419 
lie Leu Pro Cys Ser Phe Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu 
680 685 690 

etc cac etc cac caa aat ate gtg gac gta caa tat atg tat ggc ctg 2467 
Leu Hie Leu His Gin Asn lie Val Asp Val Gin Tyr Met Tyr Gly Leu 
695 700 705 

tea cct gee etc aca caa tat ate gtt cga tgg gag tgg gta gta etc 2515 
Ser Pro Ala Leu Thr Gin Tyr lie Val Arg Trp' Glu Trp Val Val Leu 
710 715 720 725 

tta ttc ctg etc eta gcg gac gee agg gtc tgc gee tgc ttg tgg atg 2563 
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met 
730 735 740 

etc ate ttg ctg ggc caa gee gaa gca gca ctg gag aag ctg gtc gtc 2611 
Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu Glu Lys Leu Val Val 
745 750 755 

ttg cac get gcg age gca get age tgc aat ggc ttc ctg tat ttt gtc 2659 
Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly Phe Leu Tyr Phe Val 
760 765 770 

ate ttt etc gtg get get tgg cac ate aag ggt agg gtg gtc ccc ttg 2707 
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He Phe Leu Val Ala Ala Trp His lie Lys Gly Arg Val Val Pro Leu 
775 780 785 

get get tat tec ctt act ggc ctg tgg ccg ttc tgc eta ctg etc eta 2755 
Ala Ala Tyr Ser Leu Thr Gly Leu Trp Pro Phe Cys Leu Leu Leu Leu 
790 795 800 805 

gca ctg ccc cag cag get tac gec tat gat gca tct gtg cac gga cag 2803 
Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala Ser Val His Gly Gin 
810 815 820 

gtg ggc gcg get ttg eta gta ctg att acc etc ttt aca etc acc ccg 2851 
Val Gly Ala Ala Leu Leu Val Leu He Thr Leu Phe Thr Leu Thr Pro 
825 830 835 

ggg tat aag acc ctt etc age cag tec ctg tgg tgg ttg tgc tat etc 2899 
Gly Tyr Lys Thr Leu Leu Ser Gin Ser Leu Trp Trp Leu Cys Tyr Leu 
840 845 850 

ctg acc ctg gcg gaa acc atg gtc cag gag tgg gca cca tec atg cag 2947 
Leu Thr Leu Ala Glu Thr Met Val Gin Glu Trp Ala Pro Ser Met Gin 
855 860 865 

gcg cgc ggc ggc cgt gat ggc ate ata tgg gee gee acc ata ttt tgc 2995 
Ala Arg Gly Gly Arg Asp Gly He He Trp Ala Ala Thr He Phe Cys 



ccg ggc gta gtg ttt gac ata acc aag tgg etc tta gcg gtg ctt ggg 
Pro Gly Val Val Phe Asp He Thr Lys Trp Leu Leu Ala Val Leu Gly 



890 



900 



cct ggt tac etc eta aga ggt get ttg acg cgc gtg cca tat ttc gtc 3091 
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Pro Gly Tyr Leu Leu Arg Gly Ala Leu Thr Arg Val Pro Tyr Phe Val 
905 910 915 

aga gcc cac get ctg ctg aga atg tgc act atg gtg agg cac etc gcg 3139 
Arg Ala His Ala Leu Leu Arg Met Cys Thr Met Val Arg His Leu Ala 
920 925 930 

ggg ggt agg tac gtc cag atg gcg eta tta gcc ctt ggc agg tgg act 3187 
Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala Leu Gly Arg Trp Thr 
935 940 945 

ggc act tac ate tat gac cac etc acc cct atg teg gat tgg get get 3235 
Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met Ser Asp Trp Ala Ala 
9S0 955 960 965 

age ggc ctg egg gac ttg gcg gtc get gtg gag cct ate ate ttc agt 3283 
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro He He Phe Ser 
970 975 980 

ccg atg gag aag aaa gtc ate gtt tgg gga gcg gag acg get gcg tgc 3331 
Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 

ggg gac ate ttg cac gga ctt ccc gtg tec gcc cga etc ggt egg gag 3379 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Arg Glu 
1000 1005 1010 

ate etc ctt ggc cca get gat ggc tac acc tec aag ggg tgg aag ctt 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
1015 1020 1025 

etc gcc ccc ate acc get tac gcc cag cag aca cga ggt etc ttg ggc 3475 
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Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 10*0 1045 

tot ata gtg gtg age atg acg ggg cgt gac aag aca gaa cag gee ggg 3523 
Ser lie Val Val Ser Met Thr Gly Arg Asp Lys Thr Glu Gin Ala Gly 
1050 



1055 1060 



gag gte caa gtc ctg tec aca gte act cag tec ttc etc gga aca tec 3571 
Glu Val Gin Val Leu Ser Thr Val Thr Gin Ser Phe Leu Gly Thr Ser 

1070 1075 



att teg ggg gtc tta tgg act gtt tac cae gga get ggc aae aag aea 3 619 
lie Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 



eta gec ggc teg egg ggc ecg gte acg cag atg tae teg age gee gag 3667 
Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 



ggg gac ttg gtc ggg tgg cce age cet cct ggg aee aaa tet ttg gag 3715 
Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1115 H20 ll 25 



ccg tgt acg tgt gga geg gte gac ctg tat ttg gte acg egg aae get 3763 
Pro cys Thr Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 H35 ll*" 

gat gtc ate ccg get cga aga egc ggg gac aag egg gga gcg ctg etc 3811 
AS p val He Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 



: ctt teg ace ttg aag ggg tec teg ggg gga cct gtg 3859 
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Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser Ser Gly Sly Pro Val 



1160 



ctt tgc cct agg ggc cac get gtc gga ate ttc egg gca get gtg tgc 3907 
Leu Cys Pro Arg Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 



1175 



1180 1185 



tet egg ggt gtg get aag tec ata gat ttc ate ccc gtt gag acg etc 3955 
Ser Arg Gly Val Ala Lys Ser lie Asp Phe He Pro Val Glu Thr Leu 
1195 1200 



gac ate gtc acg egg 



ct ccc acc ttt agt gac aac age ■ 



■ Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 



gct gtg ccc eag acc tat cag gtg ggg tac ttg cac gee ccc act ggc 4051 
Ala val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 



agt gga aaa age ace aag gtc ccc gtc geg tac gee gee eag ggg tat 4099 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gin Gly Tyr 
1240 1245 1250 

aaa gtg etg gtg etc aat eec teg gtg get gee acc ctg gga ttt ggg 4147 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 



gcg tac ttg tee aag gca cat ggc ate aac ccc aac att agg act gga 4195 



Ala Tyr Leu Ser Lys Ala His Gly He As 



i pro Asn He Arg Thr Gly 



1270 

gtc aga act gtg acg ace ggg gag ccc att aca tae tec acg tat ggt 4243 
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Val Arg Thr val Thr Thr Gly Glu Pro He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 

aaa ttc etc gec gat ggg ggc tgc gca ggc ggc gec tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp He He 
1305 1310 1315 

ata tgc gat gaa tgc cac tct gtg gat get ace act att etc ggc ate 4339 
He Cys Asp Glu Cys His Ser Val Asp Ala Thr Thr He Leu Gly He 
1320 1325 1330 

ggg aca gtc ctt gac caa gca gag aca gee ggg gtc agg eta act gta 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr Val 
133S 1340 1345 

ctg gec acg gee acg ccc ccc ggg teg gtg aca acc ccc cat ccc aat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn 
1350 1355 1360 1365 

ata gag gag gta gee etc gga cag gag ggt gag ate ccc ttc tat ggg 4483 
He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu He Pro Phe Tyr Gly 
1370 1375 1380 

agg gcg ttt ccc ctg tct tac ate aag gga ggg agg cac ttg att ttc 4531 
Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gca acg gee ctt egg ggc 457 9 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Thr Ala Leu Arg Gly 
1400 1405 1410 

atg ggc ttg aac get gtg gca tat tac aga ggg ttg gac gtc tec ata 4627 
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Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Sly Leu Asp Val Ser lie 
1415 1420 1425 

ata cca act caa gga gat gtg gtg gtc gtt gcc acc gac gcc etc atg 4575 
He Pro Thr Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 

acg ggg tat act gga gac ttt gac tec gtg ate gac tgc aac gta gcg 4723 
Thr Gly Tyr Thr Gly Asp Phe Aap Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc cag gcc gta gac ttc age ctg gac ccc acc ttc act ata acc 4771 
Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 
1465 1470 1475 

aca cag act gtc ccg caa gac get gtc tea cgt agt cag cgc cga ggg 4819 
Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 

1480 1485 1490 

cgc acg ggt aga gga aga ctg ggc att tat agg tat gtt tec act ggt 4867 
Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1S05 

gag cga gcc tea gga atg ttt gac agt gta gta etc tgt gag tgc tac 4915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 1515 1S20 1525 

gac gca gga get get tgg tat gag etc tea cca gtg gag acg acc gtc 4 963 
Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro Val Glu Thr Thr Val 
1530 1535 1540 

agg etc agg gcg tat ttc aac acg cct ggc ttg cct gtg tgc cag gac 5011 
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Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1S50 1555 

cac ctt gag ttt tgg gag gca gtt ttc acc ggc etc aca cac ata gac 5059 
Mis Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His lie Asp 
1560 1565 1570 

get cat ttc ctt tec cag aca aag cag teg ggg gaa aat ttc gca tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly Glu Asn Phe Ala Tyr 
1575 1580 1585 

tta gta gec tat cag gec aca gtg tgc gec agg gec aaa gcg ccc ccc 5155 
Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 
1590 1595 1600 1605 

ccg tec tgg gac gtc atg tgg aag tgc ttg act cga etc aag ccc acg 5203 
Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr Arg Leu Lys Pro Thr 

1610 1615 1620 

ctt gtg ggc cct aca cct etc ctg tac cgt ttg ggc tct gtt acc aac 5251 
Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ser Val Thr Asn 



gag gtc acc ctt aca cac ccc gtg aca aaa tac ate gec aca tgc atg 5299 

Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr He Ala Thr Cys Met 
1640 1645 1650 

caa get gac etc gag gtc atg acc age acg tgg gtc ctg get ggg gga 5347 

Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 

1655 1660 1665 

gtc tta gca gec gtc gec gcg tat tgc tta gcg acc ggg tgt gtt tec 539 5 
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Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cya Val Ser 
1670 1675 1680 168B 

ate att ggc cgt tta cac ate aac cag cga get gtc gtc get ccg gac 5443 

lie lie Gly Arg Leu His lie Asn Gin Arg Ala Val Val Ala Pro Asp 
1690 1695 1700 

aag gag gtc etc tat gag get ttt gat gag atg gag gaa tgt gec tec 5491 

Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 

aga gcg get etc ctt gaa gag ggg cag egg ata gee gag atg ctg aag 553 9 

Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg lie Ala Glu Met Leu Lys 
1720 1725 1730 

tec aag ate caa ggc tta ttg cag caa gee tct aaa cag gec cag gac 5587 

Ser Lys lie Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
1735 1740 1745 

ata caa ccc get gtg caa get teg tgg ccc aag atg gag caa ttc tgg 563 5 

He Gin Pro Ala Val Gin Ala Ser Trp Pro Lya Met Glu Gin Phe Trp 
1750 1755 1760 1765 

gee aaa cat atg tgg aac ttc ata age ggc att cag tac etc gca gga 5683 

Ala Lys His Met Trp Asn Phe He Ser Gly lie Gin Tyr Leu Ala Gly 
1770 1775 1780 

ctg tea aca ctg cca ggg aac cct get gtg get tec atg atg gca ttc 5731 

Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 

age gec gee etc ace agt ccg ttg tea act age acc ace ate ctt ctt 5779 
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Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr lie Leu Leu 
1800 1805 1810 

aac att ctg ggg ggc tgg ctg gcg tec caa att gcg cca ccc gcg ggg S827 
Asn lie Leu Gly Gly Trp Leu Ala Ser Gin lie Ala Pro Pro Ala Gly 
1815 1820 182S 

gec act ggc ttt gtt gtc agt ggc ctg gtg gga get get gtt ggc age 5875 
Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 
1830 1835 1840 1845 

ata ggc ttg ggt aaa gtg ctg gtg gac ate ctg gca ggg tat ggt gcg 592 3 
He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
1850 1855 I860 

ggc att teg ggg gee etc gtc gcg ttt aag ate atg tct ggc gag aag 5971 
Gly He Ser Gly Ala Leu Val Ala Phe Lys lie Met Ser Gly Glu Lys 
1865 1870 1875 

ccc tec atg gag gat gtc ate aac ttg ctg cct ggg att ctg tct cca 6019 
Pro Ser Met Glu Asp Val He Asn Leu Leu Pro Gly He Leu Ser Pro 
1880 1885 1890 

ggt get ctg gtg gtg gga gtc ate tgc gcg gec att ctg cgc cgc cat 6067 
Gly Ala Leu Val Val Gly Val He Cys Ala Ala He Leu Arg Arg His 
1895 1900 1905 

gtg gga ccg ggg gaa ggc gcg gtc caa tgg atg aac agg ctt ate gee 6115 
Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
1910 1915 1920 1925 

ttc get tec aga gga aac cac gtc gee cct act cac tac gtg acg gag 6163 
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Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 
1930 1935 1940 

teg gat gcg teg cag cgt gtc acc caa ctg ctt ggc tct etc act ata 6211 
Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr He 
1945 1950 1955 

act agt eta etc agg aga ctt cac aac tgg ate act gag gat tgc ccc 6259 
Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 
I960 1965 1970 

ate cca tgc gec ggc teg tgg etc cgc gat gtg tgg gac tgg gtc tgt 6307 
He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 
1975 1980 ■ 1985 

acc ate eta aca gac ttt aag aac tgg ctg acc tec aag ctg ttc cca 6355 
Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
1990 1995 2000 2005 

aag atg cct ggc etc ccc ttt ate tct tgc caa aag ggg tac aag ggc 6403 
Lys Met Pro Gly Leu Pro Phe He Ser Cys Gin Lys Gly Tyr Lys Gly 
2010 2015 2020 

gtg tgg gec ggc act ggc ate atg acc aca cga tgc ccc tgc ggc gee 6451 
Val Trp Ala Gly Thr Gly He Met Thr Thr Arg Cys Pro Cys Gly Ala 
2025 2030 2035 

aac ate tct ggc aac gtc cgc ttg ggc tct atg aga ate aca gga ccc 64 99 
Asn He Ser Gly Asn Val Arg Leu Gly Ser Met Arg He Thr Gly Pro 
2040 2045 2050 

aaa acc tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgt tat 6547 



123 



JP Application No. 2003-329115 



Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro He Asn Cys Tyr 
205B 2060 2065 

aca gaa ggc cag tgc ttg ccg aaa ccc gcg tta aac ttc aag acc gcc 6595 
Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu Asn Phe Lys Thr Ala 
2070 2075 2080 2085 

ate tgg aga gtg gcg gcc tea gag tac gcg gaa gtg acg cag cac gga 6643 
He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 

tea tat gcc tat ata aca ggg ctg acc act gac aac tta aaa gtc cct 6691 
Ser Tyr Ala Tyr He Thr Gly Leu Thr Thr Asp Asn Leu Lys Val Pro 
210S 2110 2115 

tgc caa etc ccc tct cca gag ttt ttc tct tgg gtg gac gga gta caa 6739 
Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 

2120 2125 2130 

ate cat agg tec gcc ccc aca cca aag ccg ttt ttc egg gat gag gtc 6787 
He His Arg Ser Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 
2135 2140 2145 

teg ttc age gtt ggg etc aat tea ttt gtc gtc ggg tct cag ctt ccc 6835 
Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val Gly Ser Gin Leu Pro 
2150 2155 2160 2165 

tgt gac cct gag ccc gac act gag gta gtg atg tec atg eta aca gac 6883 
Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met Ser Met Leu Thr Asp 
2170 2175 2180 

cca tec cat ate acg gcg gag get gca gcg egg cgt tta gcg egg ggg 6931 
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Pro Ser His He Thr Ala Glu Ala Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 

tea ccc cca tct gag gca age tec tea gcg age cag ctg teg gcg cca 6979 
Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 
2200 2205 2210 

teg ctg cga gee acc tgc acc acc cac ggt agg acc tat gat gtg gac 7027 
Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg Thr Tyr Asp Val Asp 
2215 2220 2225 

atg gtg gat gec aac ctg ttc atg ggg ggc ggc gtg att egg ata gag 7075 
Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly Val He Arg He Glu 
2230 2235 2240 2245 

tct gag tec aaa gtg gtc gtt ctg gac tec etc gac tea atg acc gag 7123 
Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu Asp Ser Met Thr Glu 

2250 2255 2260 

gaa gag ggc gac ctt gag cct tea gta cca teg gag tat atg etc ccc 7171 
Glu Glu Gly Asp Leu Glu Pro Ser Val Pro Ser Glu Tyr Met Leu Pro 
2265 2270 2275 

agg aag agg ttc cca ccg gee tta ccg get tgg gcg egg cct gat tac 7219 
Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 

aac cca ccg ctt gtg gaa teg tgg aag agg cca gat tac caa cca ccc 7267 
Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro Asp Tyr Gin Pro Pro 
2295 2300 2305 

act gtt gcg ggc tgt get etc ccc ccc ccc aaa aag acc ccg acg cct 7315 
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Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Thr Pro Thr Pro 
2310 2315 2320 2325 

cct cca agg aga cgc egg aca gtg ggt ctg age gag age acc ata gga 7363 
Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr He Gly 
2330 2335 2340 

gat gec etc caa cag ctg gec ate aag tec ttt ggc cag ccc ccc cca 7411 
Asp Ala Leu Gin Gin Leu Ala He Lys Ser Phe Gly Gin Pro Pro Pro 
2345 2350 2355 

age ggc gat tea ggc ctt tec acg ggg geg gac gee gee gac tec ggc 7459 
Ser Gly Aap Ser Gly Leu Ser Thr Gly Ala Asp Ala Ala Asp Ser Gly 
2360 2365 2370 

gat egg aca ccc cct gac gag ttg get ctt teg gag aca ggt tct acc 7507 
Asp Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser Glu Thr Gly Ser Thr 
2375 2380 2385 

tec tec atg ccc ccc etc gag ggg gag cct ggg gac cca gac ctg gag 7555 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 
2390 2395 2400 2405 

cct gag cag gta gag ctt caa cct cct ccc cag ggg ggg gag gca get 7603 
Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Glu Ala Ala 
2410 2415 2420 

ccc ggc teg gac teg ggg tec tgg tct act tgc tec gag gag gat gac 7651 
Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp 
2425 2430 2435 

tec gtc gtg tgc tgc tec atg tea tat tec tgg acc ggg get eta ata 7699 
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Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu lie 
2440 2445 2450 

act cct tgt age ccc gaa gag gaa aag ttg cca att aac tec ttg age 774 7 

Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro He Asn Ser Leu Ser 
2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gta tac tgt act aca tea aag 7795 

Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 

2470 2475 2480 2485 

agt gee tea eta agg get aaa aag gta act ttt gat agg atg caa gtg 7843 

Ser Ala Ser Leu Arg Ala Lys Lys Val Thr phe Asp Arg Met Gin Val 

2490 2495 2500 

etc gac gee tat tat gat tea gtc tta aag gac ate aag eta gcg gee 7891 

Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp He Lys Leu Ala Ala 
2505 2510 2515 

tec aag gtc age gca agg etc etc acc tta gag gag gcg tgc caa ttg 7939 

Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

acc cca ccc cac tct gca aga tec aag tat ggg ttt ggg get aag gag 7987 

Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 
2535 2540 2545 

gtc cgc age ttg tec ggg agg gee gtc aac cac ate aag tec gtg tgg 8035 

Val Arg Ser Leu Ser Gly Arg Ala Val Asn His He Lys Ser Val Trp 

2550 2555 2S60 2565 

aag gac etc ttg gaa gac tea caa aca cca att cct aca acc ate atg 8083 
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Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro lie Pro Thr Thr lie Met 
2570 2575 2580 

gcc aaa aat gag gtg ttc tgc gtg gac ccc gcc aag ggg ggt aaa aaa 813 1 

Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lye Lys 
2585 2590 2595 

cca get cgc ctt ate gtt tac cct gac etc ggc gtc agg gtc tgc gag 8179 

Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 2605 2610 

aag atg gcc ctt tat gat gtc aca caa aag ctt cct cag gcg gtg atg 8227 

Lye Met Ala Leu Tyr Aap Val Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 2620 2625 

ggg get tct tat ggc ttc cag tac tec ccc get cag egg gtg gag ttt 827 5 

Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Phe 

2630 2635 2640 2645 

etc ttg aag gca tgg gcg gaa aag aga gac cct atg ggt ttt teg tat 8323 

Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro Met Gly Phe Ser Tyr 
2650 2655 2660 

gat acc cga tgc ttt gac tea acc gtc act gag aga gac ate agg act 8371 

Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp lie Arg Thr 
2665 2670 2675 

gag gag tec ata tac cag gcc tgc tec tta ccc gag gag gcc cga act 8419 

Glu Glu Ser lie Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 
2680 2685 2690 

gcc ata cac teg ctg act gag aga etc tat gtg gga ggg ccc atg ttc 8467 
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Ala He His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 

aac age aag ggc cag tec tgc ggg tac agg cgt tgc cgc gec age ggg 8515 
Asn Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 
2710 2715 2720 2725 

gtg ctt acc act agt atg ggg aac acc ate aca tgc tat gta aaa gec 8563 
Val Leu Thr Thr Ser Met Gly Asn Thr He Thr Cys Tyr Val Lys Ala 
2730 2735 2740 

eta gcg get tgc aag get gcg ggg ata att gcg ccc acg atg ctg gta 8611 
Leu Ala Ala Cys Lys Ala Ala Gly He He Ala Pro Thr Met Leu Val 
2745 2750 2755 

tgc ggc gac gac ttg gtc gtc ate tea gaa age cag ggg act gag gag 8S59 
Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 

2760 2765 2770 

gac gag egg aac ctg aga gec ttc acg gag get atg acc agg tat tct 8707 
Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 

gec cct cct ggt gac ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 
Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu He 
2790 2795 2800 2805 

aca tct tgt tec tea aac gtg tct gtg gca ctt ggc cca cag ggc cgc 8803 
Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Gin Gly Arg 
2810 2815 2820 

cgc aga tac tac ctg acc aga gac ccc acc act tea att gec egg get 8851 
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Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Ser He Ala Arg Ala 
2825 2830 2835 

gcc tgg gaa aca gtt aga cac tec cct gtc aat tea tgg ctg gga aac 8899 
Ala Trp Glu Thr Val Arg His Ser Pro Val Asn Ser Trp Leu Gly Asn 
2840 2845 2850 

ate ate cag tac get cca acc ata tgg gtt cgc atg gtc ctg atg aca 8947 
He He Gin Tyr Ala Pro Thr He Trp Val Arg Met Val Leu Met Thr 
2855 28S0 2865 

cac ttc ttc tec att etc atg gcc cag gac acc eta gac cag aac ctt 8995 
His Phe Phe Ser He Leu Met Ala Gin Asp Thr Leu Asp Gin Asn Leu 
2870 2875 2880 2885 

aac ttt gaa atg tac gga teg gtg tac tec gtg agt cct ctg gac etc 904 3 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Ser Pro Leu Asp Leu 
2890 2895 2900 

cca gcc ata att gaa agg tta cac ggg ctt gac gcc ttc tct ctg cac 9091 
Pro Ala He He Glu Arg Leu His Gly Leu Asp Ala Phe Ser Leu His 
2905 2910 2915 

aca tac act ccc cac gaa ctg acg egg gtg get tea gcc etc aga aaa 913 9 
Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

ctt ggg gcg cca ccc etc aga gcg tgg aag agt egg gcg cgt gca gtt 9187 
Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser Arg Ala Arg Ala Val 
2935 2940 2945 

agg gcg tec etc ate tec cgt ggg ggg agg gcg gcc gtt tgc ggt egg 923 5 
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Arg Ala Ser Leu lie Ser Arg Gly Gly Arg Ala Ala Val Cys Gly Arg 
2950 2955 2960 2965 

tac etc ttc aac tgg gcg gtg aag acc aag etc aaa etc act cct ttg 9283 
Tyr Leu Phe Aan Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 

ccg gag gca cgc etc ctg gat ttg tec agt tgg ttt acc gtc ggc gec 9331 
pro Glu Ala Arg Leu Leu A3p Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att tat cac age gtg teg cgt gee cga ccc cgc eta 9379 
Gly Gly Gly Asp He Tyr His Ser Val Ser Arg Ala Arg Pro Arg Leu 
3000 3005 3010 

tta etc ctt age eta etc eta ctt tct gta ggg gta ggc etc ttc eta 9427 
Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly Val Gly Leu Phe Leu 

3015 3020 3025 

etc ccc get cga tag agcggcacac attagctaca ctccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 

tttttttttt tttttttttt tttttttttt tttttttctt tttttttttt tttccctctt 9542 

tcttcccttc tcatcttatt ctactttctt tcttggtggc tccatcttag ccctggtcac 9602 

ggctagctgt gaaaggtccg tgagcegcat gaetgeagag agtgccgtaa ctggtctctc 9662 
tgcagatcat gt 9674 
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<210> 6 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 6 

Met Ser Thr Asn Pro Lya Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

15 10 IS 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 

35 40 45 

Thr Arg Lys Ala Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

He Pro Lys His Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 

65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 

100 105 HO 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Leu Gly Tyr Val Pro Val Val Gly Ala Pro Leu 

130 135 140 

Ser Gly Val Ala Ser Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 1SS 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Val 

180 185 190 

Gin Val Lys Asn Thr Ser Asn Ala Tyr Met Ala Thr Asn Asp Cys Ser 
195 200 205 
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Asn Asp Ser lie Thr Trp Gin Leu Glu Ala Ala Val Leu His Val Pro 

210 215 220 

Gly Cys Val Pro Cys Glu Lys Met Gly Asn Thr Ser Arg Cys Trp lie 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Arg Gin Pro Gly Ala Leu Thr Arg 

245 250 255 

Gly Leu Arg Thr His lie Asp Met Val Val Leu Ser Ala Thr Leu Cys 

250 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ser 

275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Glu Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly Ala He Thr Gly His Arg Met Ala Trp 
305 310 31S 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Thr Met He Leu Ala Tyr 

325 330 335 

Val Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 

340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 

355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ser Gly Val Asp Ala Tyr 

370 375 380 

Thr Thr Thr Thr Gly Ser Ala Ala Gly Arg Thr Thr Ser Ser Leu Ala 
385 390 395 400 

Ser Ala Phe Ser Pro Gly Ala Arg Gin Asn He Gin Leu He Asn Thr 

405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu His Thr Gly Phe Phe Thr Ala Leu Phe Tyr He His Lys Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ser Ala Cys Arg Asn He Glu Asp 
450 455 460 
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Phe Arg He Gly Trp Gly Ala Leu 
455 470 
Pro Glu Asp Met Arg Pro Tyr Cys 
485 

Gly Val Val Pro Ala Gly Thr Val 
500 505 
Pro Ser Pro Val Val Val Gly Thr 

515 520 
Tyr Thr Trp Gly Glu Asn Glu Thr 

530 535 
Arg Pro Pro Ser Gly Ser Trp Phe 
545 550 
Gly Phe Thr Lys Thr Cys Gly Ala 
565 

Phe Asn Thr Ser Thr Asp Leu Leu 
580 585 
His Pro Glu Ala Thr Tyr He Lys 

595 600 
Pro Lys Cys Leu Val Asp Tyr Pro 

610 615 
Thr Val Asn Tyr Ser Thr Phe Lys 
625 630 
Glu His Arg Leu Met Ala Ala Cys 
645 

Asn Leu Glu Asp Arg Asp Arg Ser 
660 665 
Thr Thr Glu Trp Ala He Leu Pro 

675 680 
Leu Ser Thr Gly Leu Leu His Leu 

690 695 
Tyr Met Tyr Gly Leu Ser Pro Ala 
705 710 



Gin Tyr Asp Asp Asn Val Thr Asn 

475 480 
Trp His Tyr Pro Pro Lys Gin Cys 
490 495 
Cys Gly Pro Val Tyr Cys Phe Thr 
510 

Thr Asp Arg Leu Gly Val Pro Thr 
525 

Asp Val Phe Leu Leu Asn Ser Thr 
54 0 

Gly Cys Thr Trp Met Asn Ser Thr 

555 560 
Pro Pro Cys Arg Thr Arg Ala Asp 

570 575 

Cys Pro Thr Asp Cys Phe Arg Lys 
590 

Cys Gly Ser Gly Pro Trp Leu Thr 
60S 

Tyr Arg Leu Trp His Tyr Pro Cys 

620 

He Arg Met Tyr Val Gly Gly Val 

635 640 
Asn Phe Thr Arg Gly Asp Arg Cys 
650 655 
Gin Gin Thr Pro Leu Leu His Ser 
670 

Cys Ser Phe Ser Asp Leu Pro Ala 
685 

His Gin Asn He Val Asp Val Gin 
700 

Leu Thr Gin Tyr He Val Arg Trp 
715 720 
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Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 

725 730 735 

Ala Cys Leu Trp Met Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu 

740 745 750 

Glu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 

755 760 765 

Phe Leu Tyr Phe Val He Phe Leu Val Ala Ala Trp His He Lys Gly 

770 775 780 

Arg Val Val Pro Leu Ala Ala Tyr Ser Leu Thr Gly Leu Trp Pro Phe 
785 790 795 BOO 

Cys Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala 

805 810 815 

Ser Val His Gly Gin Val Gly Ala Ala Leu Leu Val Leu He Thr Leu 

820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Ser Gin Ser Leu Trp 

833 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Ala Glu Thr Met Val Gin Glu Trp 

850 855 860 

Ala Pro Ser Met Gin Ala Arg Gly Gly Arg Asp Gly He He Trp Ala 
865 870 875 880 

Ala Thr He Phe Cys Pro Gly Val Val Phe Asp He Thr Lys Trp Leu 

885 890 895 

Leu Ala Val Leu Gly Pro Gly Tyr Leu Leu Arg Gly Ala Leu Thr Arg 

900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Met Cys Thr Met 

915 920 925 

Val Arg His Leu Ala Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala 

930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met 
945 950 955 960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 
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Pro He He Phe Ser Pro Met Glu Lys Lys Val He Val Trp Gly Ala 

980 985 990 

Glu Thr Ala Ala Cys Gly Asp He Leu His Gly Leu Pro Val Ser Ala 

995 1000 1005 

Arg Leu Gly Arg Glu He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 

1010 1015 1020 

Lys Gly Trp Lys Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Ser He Val Val Ser Met Thr Gly Arg Asp Lys 

1045 1050 1055 

Thr Glu Gin Ala Gly Glu Val Gin Val Leu Ser Thr Val Thr Gin Ser 

1060 1065 1070 

Phe Leu Gly Thr Ser He Ser Gly Val Leu Trp Thr Val Tyr His Gly 

1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met 

1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 HIS 1120 

Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

Val Thr Arg Asn Ala Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly He Phe 

1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin val Gly Tyr Leu 
1220 1225 1230 
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His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Thr Thr Gly Glu Pro He Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 

1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Val Asp Ala Thr 
1315 1320 1325 

Thr He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 

1345 1350 1355 1360 

Thr Pro His Pro Asn, He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly 

1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Thr Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Thr Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 
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Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro 

1525 1530 1535 

Val Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 

1665 1670 1675 1660 

Thr Gly Cys Val Ser He He Gly Arg Leu His He Asn Gin Arg Ala 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He 

1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 
1730 1735 1740 
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Lys Gin Ala Gin Asp lie Gin Pro Ala Val Gin Ala Ser Trp Pro Lys 

1745 1750 1755 1760 

Met Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr He Leu Leu Asn He Leu Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 

I860 18S5 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 

1875 1880 1B85 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

1955 I960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 
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Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe He Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 

2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg lie Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu 
2065 2070 2075 2080 

Asn Phe Lys Thr Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 
2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ala Tyr He Thr Gly Leu Thr Thr Asp 

2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin He His Arg Ser Ala Pro Thr Pro Lys Pro Phe 

2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val 

2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met 

2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Ala Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 

2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg 

2210 2215 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly 

2225 2230 2235 2240 

Val He Arg He Glu Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu 

2245 2250 2255 
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Asp Ser Met Thr Glu Glu Glu Gly Asp Leu Glu Pro Ser Val Pro Ser 

2260 2265 2270 

Glu Tyr Met Leu Pro Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 

2290 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 

2305 2310 2315 2320 

Lys Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu Ser Thr He Gly Asp Ala Leu Gin Gin Leu Ala He Lys Ser Phe 

2340 2345 2350 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Asp 

2355 2360 2365 

Ala Ala Asp Ser Gly Asp Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser 

2370 2375 2380 

Glu Thr Gly Ser Thr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 

2385 2390 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Glu Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 

2420 2425 2430 

Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu He Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

He Asn Ser Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 

2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 

2485 2490 2495 

Asp Arg Met Gin Val Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp 

2500 2505 2510 
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He Lys Ley Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

He Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro lie 

2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly 

2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro 

2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp He Arg Thr Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 2685 

Glu Glu Ala Arg Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Ser Cyg Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly He He Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Ser Glu Ser 
2755 2760 2765 
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Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
278S 2790 2795 2800 

Asp Leu Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Ser lie Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Aan 

2835 2840 2845 

Ser Trp Leu Gly Asn lie lie Gin Tyr Ala Pro Thr lie Trp Val Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Ala Gin Asp Thr 
2855 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 

2885 2 830 2895 

Ser Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Tyr His Ser Val Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly 
3010 3015 3020 
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Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 7 
<211> 8024 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 7 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 12 0 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 42 0 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 54 0 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 72 0 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 78 0 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 84 0 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 108 0 
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ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 114 0 
uoccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 13 80 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1SO0 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cuoaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
oaugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 18 00 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuago cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucuoo 2160 
ccgagaccca uuucgaccuvt gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuuoag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 24 60 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc aoauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2 64 0 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 27 0 0 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaaaag guaggccucg ggcgggaggg ugagaucccc 282 0 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 28 80 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 294 0 
guggcauacu auagaggguu ggacguouco auaauaccag cucagggaga uguggugguc 3000 
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gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3 06 0 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3 24 0 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3 3 0 0 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3S40 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3 660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3840 
gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 39S0 
cccaaagugg aacaauuuug ggccagaoac auguggaacu ucauuagcgg cauccaauac 4 02 0 
cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gctiuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 
agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 4620 
uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucuouggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 
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cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 504 0 
caacuaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 54 00 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 5460 
auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 552 0 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 57 00 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccocu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5 94 0 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 624 0 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc agxiugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 6360 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6540 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 67 2 0 
gaguccauau aocaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 



147 



JP Application No. 2003-329115 



agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 69 00 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau gcugguaugc 6 96 0 
ggcaaugacc uaguagucau cucagaaagc caggggacug aggaggacga gcggaaccug 7 02 0 
agagccuuca cggaggccau gaccagguac ucugccccuc cuggugaucc ccccagaccg 7080 
gaauaugacc uggagcu&au aacauccugu uccucaaaug ugucuguggc guugggcccg 714 0 
cggggccgcc gcagauacua ccugaccaga gacccaacca cuccacucgc ccgggcugcc 72 00 
ugggaaacag uuagacacuc cccuaucaau ucauggcugg gaaacaucau ccaguaugcu 72 60 
ccaaccauau ggguucgcau gguccuaaug acacacuucu ucuccauucu caugguccaa 732 0 
gacacccugg accagaaccu caacuuugag auguauggau caguauacuc cgugaauccu 73 80 
uuggaccuuc cagccauaau ugagagguua cacgggcuug acgccuuuuc uaugcacaca 744 0 
uacucucacc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagggugu ggaagagucg ggcucgcgca gucagggcgu cccucaucuc ccguggaggg 7560 
aaagcggccg uuugcggccg auaucucuuc aauugggcgg ugaagaccaa gcucaaacuc 7620 
acuccauugc cggaggcgcg ccuacuggac uuauccaguu gguucaccgu cggcgccggc 7680 
gggggcgaca uuuuucacag cguguogcgc gcccgacccc gcucauuacu cuucggccua 774 0 
cuccuacuuu ucguaggggu aggccucuuc cuacuccccg cucgguagag cggcacacac 7800 
uagguacacu ccauagcuaa cuguuccuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7860 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc uuucuucccu ucucaucuua uucuacuuuc 7 92 0 
uuucuuggug gcuccaucuu agcccuaguc acggcuagcu gugaaagguc cgugagccgc 7980 
augacugcag agagugccgu aacuggucuc ucugcagauc augu 8024 



<210> 8 
<211> 7994 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 8 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
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cuucacgcag aaagcgccua gccauggcgu 
cccccucccg ggagagccau aguggucugc 
aagacugggu ccuuucuugg auaaacccac 
caagacugcu agccgaguag cguuggguug 
cgcuugcgag ugccccggga ggucucguag 
ucaaagaaaa accaaaagaa acaccaaccg 
cgcagguucu ccggccgcuu ggguggagag 
aaucggcugc ucugaugccg ccguguuccg 
ugucaagacc gaccuguccg gugcccugaa 
guggcuggcc acgacgggcg uuccuugcgc 
aagggacugg cugcuauugg gcgaagugcc 
uccugccgag aaaguaucca ucauggouga 
ggcuaccugc ccauucgacc accaagcgaa 
ggaagccggu ouugucgauc aggaugaucu 
cgaacuguuc gccaggcuca aggcgcgcau 
uggcgaugcc ugcuugccga auaucauggu 
cuguggccgg cugggugugg cggaccgcua 
ugcugaagag cuuggcggcg aaugggcuga 
ucccgauucg cagcgcaucg ccuucuaucg 
ccucucccuc cccccccccu aacguuacug 
cguuugucua uauguuauuu uccaccauau 
aaccuggccc ugucuucuug acgagcauuc 
ugcaaggucu guugaauguc gugaaggaag 
caacgucugu agcgacccuu ugcaggcagc 
gcggccaaaa gccacgugua uaagauacac 
uugugaguug gauaguugug gaaagaguca 
ggcugaagga ugcccagaag guaccccauu 
caugcuuuac auguguuuag ucgagguuaa 
acgugguuuu ccuuugaaaa acacgaugau 
caaacacgag gccuccuggg cgccauagug 
caggccgggg aaguccaaau ccuguccaca 
ucggggguuu uguggacugu uuaccacgga 



uaguaugagu gucguacagc cuccaggccc 12 0 
ggaaccggug aguacaccgg aauugccggg 18 0 
ucuaugcccg gccauuuggg cgugcccccg 24 0 
cgaaaggccu ugugguacug ccugauaggg 3 00 
accgugcacc augagcacaa auccuaaacc 360 
ucgcccaaug auugaacaag auggauugca 420 
gcuauucggc uaugacuggg cacaacagac 48 0 
gcugucagcg caggggcgcc cgguucuuuu 54 0 
ugaacugcag gacgaggcag cgcggcuauc 600 
agcugugcuc gacguuguca cugaagcggg 560 
ggggcaggau cuccugucau cucaccuugc 72 0 
ugcaaugcgg cggcugcaua cgcuugaucc 780 
acaucgcauc gagcgagcac guaeucggau 84 0 
ggacgaagag caucaggggc ucgcgccagc 900 
gcccgacggc gaggaucucg ucgugaccca 96 0 
ggaaaauggc cgcuuuucug gauucaucga 1020 
ucaggacaua gcguuggcua cccgugauau 10 8 0 
ccgcuuccuc gugcuuuacg guaucgccgc 114 0 
ccuucuugac gaguucuucu gaguuuaaac 12 0 0 
gccgaagccg cuuggaauaa ggccggugug 1260 
ugccgucuuu uggcaaugug agggcccgga 1320 
cuaggggucu uuccccucuc gccaaaggaa 1380 
caguuccucu ggaagcuucu ugaagacaaa 14 4 0 
ggaacccccc accuggcgac aggugccucu 1500 
cugcaaaggc ggcacaaccc cagugccacg 1560 
aauggcucuc cucaagcgua uucaacaagg 162 0 
guaugggauc ugaucugggg ccucggugca 1680 
aaaaacgucu aggccccccg aaccacgggg 1740 
accauggcuc ccaucacugc uuaugcccag 1800 
gugaguauga cggggcguga caggacagaa 1860 
gucucucagu ccuuccucgg aacaaccauc 1920 
gcuggcaaca agacucuagc cggcuuacgg 19 8 0 
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gguccgguca cgcagaugua cucgagugcu 
ccugggacca agucuuugga gccgugcaag 
cggaacgcug augucauccc ggcucggaga 
ccgagaccca uuucgaccuu gaaggggucc 
cacgucguug ggcucuuccg agcagcugug 
uucauccccg uugagacacu cgacguuguu 
acgccaccgg cugugcccca gaccuaucag 
ggaaagagca ccaagguccc ugucgcguau 
aaccccucgg uagcugccac ccugggguuu 
aaucccaaca uuaggacugg agucaggacc 
acauauggca aauuucucgc cgaugggggc 
ugcgaugaau gccacgcugu ggaugcuacc 
caagcagaga cagccggggu cagacuaacu 
gugacaaccc cccaucccga uauagaagag 
uucuauggga gggcgauucc ccuauccugc 
cacucaaaga aaaaguguga cgagcucgcg 
guggcauacu auagaggguu ggacgucucc 
gucgccaccg acgcccucau gacgggguac 

cagacugucc cacaagacgc ugucucacgc 
agacagggca cuuauaggua uguuuccacu 
guagugcuuu gugagugcua cgacgcaggg 
accaccguca ggcuuagagc guauuucaac 
cuugaauuuu gggaggcagu uuucaccggc 
caaacaaagc aagcggggga gaacuucgcg 
gccagagcca aggccccucc cccguccugg 
aagccuacgc uugcgggccc cacaccucuc 
gucacccuca cacacccugg gacgaaguac 
gucaugacca gcacgugggu ccuagcugga 
cuggcgacug gaugcguuuc caucaucggc 
gcgccggaua aggagguccu guaugaggcu 
gcggcucuoa ucgaagaggg gcagcggaua 



gagggggacu ugguaggcug gcccagcccc 2 04 0 
uguggagccg ucgaccuaua ucuggucacg 210 0 
cgcggggaca agcggggagc auugcucucc 2160 
ucgggggggc cggugcucug cccuaggggc 222 0 
ugcucucggg gcguggccaa auccaucgau 22 8 0 
acaaggucuc ccacuuucag ugacaacagc 2340 
gucggguacu ugcaugcucc aacuggcagu 24 0 0 
gccgcccagg gguacaaagu acuagugcuu 24 60 
ggggcguacc uauccaaggc acauggcauc 252 0 
gugaugaccg gggaggccau cacguacucc 2580 
ugcgcuagcg gcgccuauga caucaucaua 264 0 
uccauucuog gcaucggaac gguccuugau 27 0 0 
gugcuggcua cggccacacc ccccggguca 2760 
guaggccucg ggcgggaggg ugagaucccc 2820 
aucaagggag ggagacaccu gauuuucugc 2880 
gcggcccuuc ggggcauggg cuugaaugcc 294 0 
auaauaccag cucagggaga uguggugguc 3000 
acuggagacu uugacuccgu gaucgacugc 3060 
agccuggacc ccaccuucac uauaaccaca 3120 
agucagcgcc gcgggcgcac agguagagga 3180 
ggugaacgag ccucaggaau guuugacagu 3 24 0 
gcugcguggu acgaucucac accagcggag 3300 
acgcccggcc uacccgugug ucaagaccau 3360 
cucacacaca uagacgccca cuuccucucc 3420 
uaccuaguag ccuaccaagc uacggugugc 3480 
gacgccaugu ggaagugccu ggcccgacuc 3540 
cuguaccguu ugggcccuau uaccaaugag 3 600 
aucgccacau gcaugcaagc ugaccuugag 3 660 
ggaguccugg cagccgucgc cgcauauugc 3 720 
cgcuugcacg ucaaccagcg agucgucguu 3 780 
uuugaugaga uggaggaaug cgccucuagg 3 84 0 
gccgagaugu ugaaguccaa gauccaaggc 3900 
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uugcugcagc aggccucuaa 
cccaaagugg aacaauuuug 
cucgcaggau ugucaacacu 
gccgcccuca ccaguccguu 
ugguuagcgu cccagaucgc 
gugggggcug ccgugggcag 
uauggugcgg gcauuucggg 
ucuauggaag augucaucaa 
ggggucaucu gcgcggccau 
uggaugaaca ggcuuauugc 
gugacggagu cggaugcguc 
agccuacuca gaagacucca 
uccuggcucc gcgacgugug 
cugaccucua aauuguuccc 
uacaagggug ugugggccgg 
aucucuggca auguccgccu 
accuggcagg ggaccuuucc 
cccacgaacu acaagaccgc 
cagcaugggu cguacuccua 
caacuaccuu cuccagaguu 
cccacaccaa agccguuuuu 
gcugucgggu cccagcuucc 
cuaacagauc cgccccacau 
ccuccaucug aggcgagcuc 
ugcaccaccc acagcaacac 
gg c gg u g u 93 cucagacaga 
auggccgagg aagagagcga 
agcgggvuruc cacgggccuu 
gaaucgugga ggaggccaga 
cccaagaagg ccccgacgcc 

ggugaugcag gcucguccac 



gcaggcccag gacauacaac 
ggccagacac auguggaacu 
gccagggaac cccgcggugg 
gucgaccagu accaccaucc 
accacccgcg ggggccaccg 
cauaggccug gguaaggugc 
ggcccucguc gcauucaaga 
ucuacugccu gggauccugu 
ucugcgccgc cacgugggac 
ouuugcuucc agaggaaacc 
gcagcgugug acccaacuac 
caauuggaua acugaggacu 
ggacuggguu ugoaccaucu 
caagcugccc ggccuccccu 
cacuggcauc augaccacgc 
gggcucuaug aggaucacag 
uaucaauugc uacacggagg 
caucuggagg guggcggccu 
uguaacagga cugaccacug 
uuucuccugg guggacggug 
ccgggaugag gucucguucu 
cugugaaccu gagcccgacg 
cacggcggag acugcggcgc 
cucagugagc cagcuaucag 
cuaugacgug gacauggucg 
gccugagucc agggugcccg 
ccuugagccc ucaauaccau 
accggcuugg gcacggccug 
uuaccaaccg cccaccguug 
ucccccaagg agacgccgga 
gcaacuggcc aucaagaccu 
gggggcgggc gccgccgaau 
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ccgcuaugca ggcuucaugg 3 960 
ucauuagcgg cauccaauac 4 020 
cuuccaugau ggcauucagu 4080 
uucucaacau caugggaggc 4140 
gcuuugucgu caguggccug 4 2 0 0 
ugguggacau ccuggcagga 4260 
ucaugucugg cgagaagccc 432 0 
cuccgggagc ccugguggug 4380 
cgggggaggg cgcgguccaa 444 0 
acgucgcccc uacucacuac 4500 
uuggcucucu uacuauaacc 4560 
gccccauccc augcuccgga 4520 
ugacagacuu caaaaauugg 4 58 0 
ucaucucuug ucaaaagggg 4740 
gcugcccuug cggcgccaac 4800 
ggccuaaaac cugcaugaac 4860 
gccagugcgc gccgaaaccc 4 92 0 
cggaguacgc ggaggugacg 4980 
acaaucugaa aauuccuugc 5 04 0 
ugcagaucca uagguuugca 5100 
gcguugggcu uaauuccuau 5160 
cagacguauu gagguccaug 5220 
ggcgcuuggc acggggauca 5280 
caccgucgcu gcgggccacc 5340 
augccaaccu gcucauggag 54 0 0 
uucuggacuu ucucgagcca 54 60 
cggagugcau gcuccccagg 5520 
acuacaaccc gccgcucgug 5580 
cugguugugc ucuccccccc 5640 
cagugggucu gagcgagagc 5700 
uuggccagcc cccoucgagc 5760 
ccggcggucc gacguccccu 5820 
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ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugcu gcuccauguc auacuocugg accggggcuc uaauaacucc cuguagcccc 6050 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 618 0 
acgcaagugc ucgacgccca uuaugaouca gucuuaaagg acaucaagcu agcggcuucc 624 0 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 63 00 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 63 60 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 6540 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 6840 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 6900 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau cucagaaagc 6960 
caggggacug aggaggacga gcggaaccug agagccuuca cggaggccau gaccagguac 7020 
ucugccccuc cuggugaucc ccccagaccg gaauaxigacc uggagcuaau aacauccugu 7080 
uccucaaaug ugucuguggc guugggcccg cggggccgcc gcagauacua ccugaccaga 714 0 
gacccaacca cuccacucgc ccgggcugcc ugggaaacag uuagacacuc cccuaucaau 72 00 
ucauggcugg gaaacaucau ccaguaugcu ccaaccauau ggguucgcau gguccuaaug 7260 
acacacuucu ucuccauucu caugguccaa gacacccugg accagaaccu caacuuugag 7320 
auguauggau caguauacuc cgugaauccu uuggaccuuc cagccauaau ugagagguua 7380 
cacgggcuug acgccuuuuc uaugcacaca uacucucacc acgaacugac gcggguggcu 7440 
ucagcccuca gaaaacuugg ggcgccaccc cucagggugu ggaagagucg ggcucgcgca 7500 
gucagggcgu cccucaucuc ccguggaggg aaagcggccg uuugcggccg auaucucuuc 7560 
aauugggcgg ugaagaccaa gcucaaacuc acuccauugc cggaggcgcg ccuacuggac 7 62 0 
uuauccaguu gguucaccgu cggcgccggc gggggcgaca uuuuucacag cgugucgcgc 7680 
gcccgacccc gcucauuacu cuucggccua cuccuacuuu ucguaggggu aggccucuuc 7740 
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cuacuccccg cucgguagag cggcacacac 

uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 

uuucuucccu ucucaucuua uucuacuuuc 

acggcuagcu gugaaagguc cgugagccgc 

ucugcagauc augu 



uagguacacu ccauagcuaa cuguuccuuu 7800 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc 78 6 0 
uuucuuggug gcuccaucuu agcccuaguc 7920 
augacugcag agagugccgu aacuggucuc 7 9 80 
7994 



<210> 9 

<211> 340 

<212> RNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic RNA 



<400> 9 

accugccccu aauaggggcg 

cuucacgcag aaagcgccua 

cccccucccg ggagagccau 

aagacugggu ccuuucuugg 

caagacugcu agccgaguag 

cgcuugcgag ugccccggga 



acacuccgcc augaaucacu 
gccauggcgu uaguaugagu 
aguggucugc ggaaccggug 
auaaacccac ucuaugcccg 
cguuggguug cgaaaggccu 
ggucucguag accgugcacc 



ccccugugag gaacuacugu 60 
gucguacagc cuccaggccc 120 
aguacaccgg aauugccggg 180 
gccauuuggg cgugcccccg 240 
ugugguacug ccugauaggg 300 
340 



<212> RNA 
<213> Artifi' 



<220> 

<223> Description of Artificial Sequence: synthetic RNA 
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<400> 10 

acccgccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 

cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 

cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 

aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 24 0 

caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 

ugcuugcgag ugccccggga ggucucguag accgugcacc 340 



<210> 11 
<211> 236 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
<400> 11 

agcggcacac acuagguaca cuccauagcu aacuguuccu uuuuuuuuuu uuuuuuuuuu 6 0 
uuuuuuuuuu uuuuuuuuuu uucuuuuuuu uuuuuuuccc ucuuucuucc cuucucaucu 12 0 
uauucuacuu ucuuucuugg uggcuccauc uuagcccuag ucacggcuag cugugaaagg 180 
uccgugagcc gcaugacugc agagagugcc guaacugguc ucucugcaga ucaugu 236 



<210> 12 

<211> 232 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: synthetic RNA 
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<400> 12 

agcggcacac auuagcuaca cuccauagcu aacuguuccu uuuuuuuuuu \ 
uuuuuuuuuu uuuuuuucuu uuuuuuuuuu uuucccucuu ucuucccuuc ucaucuuauu 12 0 
cuacuuucuu ucuugguggc uccaucuuag cccuggucac ggcuagcugu gaaagguccg 18 0 
ugagccgcau gacugcagag agugccguaa cuggucucuc ugcagaucau gu 2 32 



<210> 13 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 13 

cgggagagcc atagtgg 



<210> 14 

<211> 19 

<212> DNA 

<213> Artificial Sequence 



223> Description of Artificial Sequence: synthetic DNA 



agtaccacaa ggcctttcg 
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<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 15 

ctgcggaacc ggtgagtaca c 

<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 16 

aacaagatgg attgcacgca 



<210> 17 
<211> 20 



13> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic DNA 



<400> 17 

cgtcaagaag gcgatagaag 
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<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 18 

gcactctctg cagtcatgcg gctcacggac 



<210> 19 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 19 

cccctgtgag gaactactgt cttcacgc 



<210> 20 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: synthetic DNA 
<400> 20 

ccgggagagc catagtggtc tgcg 24 



<210> 21 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 21 

ccactcaaag aaaaagtgtg acgagctcgc 



<210> 22 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic DNA 



ggcttgggca cggcctga 



<210> 23 
<211> 30 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 23 

gcggtgaaga ccaagctcaa actcactcca 30 

<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 24 

agaacctgcg tgcaatccat c 21 

<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 25 

cccgtcatga gggcgtcggt ggc 23 
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<210> 26 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 26 

accagcaacg gtgggcggtt ggtaatc 



<210> 27 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: synthetic DNA 



<400> 27 

ggcacgcgac acgctgtg 



<210> 28 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 



160 



JP Application No. 2003-329115 



<223> Deecription of Artificial Sequence: synthetic DNA 
<400> 28 

agctagccgt gactagggct aagatggagc 



<210> 29 

<211> 20 

<212> DHA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
DNA(primer) 

<400> 29 

aacaagatgg attgcacgca 20 



<210> 30 
<2ll> 20 

<2 12> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
DNA(primer) 

<400> 30 

cgtcaagaag gcgatagaag 20 
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<210> 31 
<21b 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic DNA 
<400> 31 

gcactctctg cagtcatgcg gctcacggac 30 

<210> 32 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic DNA 

<400> 32 

cccctgtgag gaactactgt cttcacgc 28 

<210> 33 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
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<400> 33 

ccgggagagc catagtggtc tgcg 



<210> 34 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 

<400> 34 

ccactcaaag aaaaagtgtg acgagctcgc 



<210> 3S 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
DNA (primer) 

<400> 35 

ggcttgggca cggcctga 18 



<210> 36 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 36 

gcggtgaaga ccaagctcaa actcactcca 30 

<210> 37 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 37 

agaacctgcg tgcaatccat c 21 

<210> 38 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
<400> 38 

cccgtcatga gggcgtcggt ggc 23 
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<210> 39 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 

<400> 39 

accagcaacg gtgggcggtt ggtaatc 27 

<210> 40 

<211> 18 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence :: synthetic DNA 

<400> 40 

ggaacgcgac acgctgtg 18 

<210> 41 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :: synthetic DNA 
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<400> 41 

agctagccgt gactagggct aagatggagc 
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[Figure 2A] 



10 20 30 40 SO 60 

aceuGcccea asuaggggcg achcoocgcc augaauc&cw cccftuc-crt&G o&acdacuso 

70 80 90 MO 110 120 

CUU&iGi3CftG AAfiGCCCCUfi GGCHWGOGU Iffi<3y*USAW CUCGOACftGC COCXSGOCCC 

130 140 150 160 ■ no ieo • 

cccca;ccce <5&iSaee6M) aguggucosc (w^c<x:-gi« wsuACACcse Muoecoeee 

190 200 210 320 330 2iQ 

aag&cugsgo cououcwjss koi&ktxckc mnmcccQ ocmioma coyscceoco 

?,50 560 ! 370 280 290 300 

CA&GACUSCU WSOOMGTOG OQOUGGGUUG OGAASjGGCCU W3UGGUACU$ CCUfiAtMQGQ ' 

3J0 320 . 330 340 350 360 

CGC0UGO3AG W3CCCXGGSA 0GUCOO30AQ ACCGOCCnCC AUGAGCA<3» AUCCOAAACC 

370 380 350 408 410 430 

QCMASAttft AOCAAAAOAA ACACCAAOCQ UCGCCC&AU3 AOtKJWiCWvG AUGGMJDGCA 

430 440 450 4«0 470 480 

C6CW5GWCU CCGGCOGdW GGGOS3AGAG GCUAUtfCGGC UAUSACUGGG CACAACASAC 

490 500 510 52* 530. 540 

AOTCQCjeofcC UCOQAUGCOS COSUSyOCOS GCUGOOvGCG CAGGGGOGCC CGGODOWUP 

S50 5S0 570 . 580 590 600 

W3UCAAJ3ACC SACCOGUCCG GSGCCCOSAA UGAACV3CAG GAO3AQ6CA0 OSCXJSCSJfttJC 

*i0 630 630 4l50 iW 

Gusecoeece aosaogggog wooroeosc ascoeoscoc gacguugoca cugaagosgq 

570 §80 630 700 710 7S0 

AAGGGfiCUGG CWGCUAU»G6 GGGAAGOGCC ©33SCAGCAP" COCCUGOCAU COCACCUOSC 

730 740 750 750 770 780 

ucttJGCee*) Mimmccz ucAUGGcvaA yecasoGCGe csecajGCftW oswugavcc 

790 800 010 . «30 830 840 

OQCUACOKC. CCOTUCmCC ACe&JWK* 5 ** ACftCOSGiVUC GAC-OGAGCAC OTACUOGGiVi} 

850 860 $70 380 8P0 900 

wmkkggo cm$uc(mc temmuau ©sacgaagm caocAGOSSC ucsogccagc 

910 920 930 940 930 960 

coAftcusuuc gccaggcuca agqobcgcau Gcccs&CGse gagqaucwki ucougacoca 

570 980 9SO iflOO 1010 ' 1020 

USOKJGCCfift AUAUCAUGKJU GSAAAWGGC CGORJUUCDG GWUCAU0GA 

1030 1040 1050 1C60 1070 100& 

C0G0GGCO3Q CUGCGWOGG CGCACO3C0A UCAGGACAUA GCSUIKgCUA CCSGOGftUAD 

10S0 1100 J.U0 US0 W30 1149 

xmoomm cwjggoggog aausqgcuga (xgcouco* GuayutiAce cuauogccgc 

115S 1160 1170 1180 U90 ' 1200 

ucsxseAwcG cagogcaoco ceooeuAoes ccuycwJGAc gasuocuucu gaguvjuaaac 
1210 1220 . mo 1240 1250 1^60 

ccucucccoc eccccecccy aaogudacch ecosAaeoce cuogqaauaa ggccggvoxs 

1370 laW 1390 1300 1310 1320 

ceyuuGueuft uAtsGcaAuuu uocaccauau u&xmucudu ugscmusug awsoccsoa 

1330 134& 1350 1360 1??0 »80 

AAcoueeccc uawmcuw ivmmimo cu&GGseucv- uuecccucuc sccaasiggaa 
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[Figure 2B] 

U90 1400 HIO 1«0 ' MSO 1440 

uGcassGucu wmmmc qyGtaGGMa caguuccqco <&Kh<scmxx> Whwimm 

im um uio urn iwo isoo 
CAaaeuewsu ac^qacccou ysoAcocftcc <m&cocec£ acsu^os&C aggiKsccucu 



1S7S 1S80 1510 «20 

OO&WAGUDG GAOaSOOWS GSA^GDCft JttUGCHSKttC CGCfcAGOGOA yOCM,C«i6<3 



1*10 1820 1830 £640 1850 18S0 

CMAC&CGAG OOCyCCDGGG <3GCCMJSGOQ OtJGAQDSlWA CGGGGOGUGft CAGG&CAQAA 



asso aeoo 3«o 2«& ssso suq 

WailMKJJCA ATOUUCUCJGC C6AUC-CKXKC OSOQCUWXM <3CG€CUAtX3ft ONOiUCKft 
2660 2670 tm 2690 2"?0O 

(jdccau&mi gccjuxscugo mtmmm ^uycuco gcwoggmc ggbccuubao 

2710 m& 2730 2740 2750 27S0 

CM&CMMft wsoofioeoo.ciwacwMKM GascuGGowi 'o&sccACftcc cccceiseua 
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[Figure 2C] 

3??0 2780 27S0 2800 ' ' 2810 2520 

gwacaacoo ctxmxooh mvMMtoG GUABGocyee GGCw^Coe ugagauoccg 

2830 3840 2350 3860 2«7Q 2880 

wcuausgsa. ggscgabucc ccuftuCeuse aucaagc&ag.gcagacaocu gmrp,joco<3c 

2830 3900 2310 3920 2930 3940 

CfcCUCAAAaA AAAAGQGUGA OSAGtUOGCG GCGOCOftWC GGGGe&SGGfl C^GAAOGCC 

3P50 2970 WW 2990 3 Odd 

goggcmjacsj Atfw&GGGoy g^cgucocc auaauaccag o?cwggw?a wspgqugguc 

3010 3020 3030 3040 3050 3060 

GtJceetKco3 acgcccocasj gacgggguac AcuGGAGAttr wsACac&v gau<$aco$c 

3070 3060 3090 3100 3130 3120 ' 

jAAUfiGAGOSS IK'ACCCAAGC UGW33ACUUC AGOCyGGAOC CCACCOUCAC WUAACCACA 

3130 3140 3150 3160 $170 3180 

CAGACOStKC CACAASACGe UGUCUCACGC K3UCASCGCC GCGGGCGCAC AGGUAGAGGA 

31$0 3200 3210 3220 3230 3240 

AGACAG3GCA OFJAUASGUA U3UUUCCACU GGUGAACGAG CCUCAGGAAU GUUUGACAGtf 

33SD 3260 3270 3380 3S90 3300 

GUSAGWCOA CtiACGCAGGG GOKsOGDGGy ACGAtfCOCaC ACCAGCGGAG 

331Q 3330 3330 3340 3350 3360 

a«yvcewx» ogcwjwsaw wwoeaaq acgcccggcc uacccgocug ucaagaccmj 
3370 3380 3390 3400 34*0 3420 

cuueAAUuw &mmcmo ooocaccsso cucacacaca uagaosqcxa cwccucuec 

3S30 . 3440 3450 34S0 3470 3480 

(AsACAfcfiGc AAeoGGseea oa&cuwjsco vaocuasum ocwwawjc vhOxwooc 

3450 3500 3510 3 320 353 0 3540 

GOCAGAGCCA A<5SC<:CCy0C OOSUCCUGG GAOSCGAtKB .<&MGCBCCU GGGCOQAOfC 

3530 3560 3570 3980 3550 3600 

MGCOJACGC UOeCGGGCOC CftCAfXtJCOa COGOACCGW yOOSCCWAU (JACCMU6AS 

3610' 3820 3630 3S40 ' 3650 3660 

GOCAOCOJCA CACACCOKG GAOGAAGUAC JVUCGCC&CMJ GCADGCAAGC UGACOJUGAG 

367Q 3660 3690 3700 3710 3720 

Gtx&oGAceft acAosoeeoy ccD&QcoGQA ggaguccugg aiGccGyesc cgcatootsc 

3730 3740 3750 3760 3770 3760 

CUGSCGACUG GAUGOSPOUC CAWayOSGC CGCUWSCACO UCAACCAGOS agucgucsjcju 

37S0 3600 3810 3820 3830 3840 

GOGOCGSAUA AGSAGGUOCD GUAOGAGGCO UOUQAUGAGA UGGACOAAOG CGCCdCUAGG 

3850 3560 3870 3880 3890 3900 

GOSGCVCwCA OCGAAGAGGG GCAGCGGAUA GCCQVJAWU PGAAOOCCAA GAtJCCAACGC 

3S10- 3930 3930 3940 3950 3S60 

OOCCDGCAGC AGGCCUCUAA 6CAGGCCCSG GACAUACMC CCGCOAUGCA GGCWC'AUGG 

3970 3980 3«90 dOOO 4010 4620 

ccchhmm mcaauwjos ggccagacac auguggmco dcauoagogs csuccaawc 

4030 4040 4050 4060 4070 . 4CB0 

CUCeCfffiSfeU TOWCfiACACU GCGAGGGASC tfCOSCGGOGG (WCOMGW OGCAUKSSD 

-iOSO 410& 4110 4120 4130 4140 

GCCGCCCDCA CaVCOCOGUO SyOGACCAGX) ACCACCAUCC UiKUCAACMJ CMJGGGAGSC 
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[Figure 2D] 

am 4its& 4i?o «eo 4190 4200 
w&ywoOGU cccagabccsc accacx#g<& g^ssccacos gojdbguogu caodggccocs 

4310 4220 4230 4240 4250 4360 

gusgggscus cogissgggag- caoaggccug GQvm&vso mmozew .c<3jsgcagga 

4270 4280 4290 4300 '4310 4320 

vimtmosQ gcauuucggg ggoocogsuc gcawcaaga pcftoowcusg <#agaagccc 

4330 4340 4350 4360 4370 4380 

' OOJflWQAJiG AUKJCatJCftA UCUACUSGCU GGGAOCCUGD CtKXGGGAGC <XOBSUGGUG 

«S0 4400 4410 4420 ■ 4430 4440 

GGGGtXatfOT GGGGGGCCAtf UCUGCGCCGC CA03UGGGAC CGGGGGAGG6 CGCGGUCCAA 

44S0 I4«0 4470 4480 4430 . 4500 

UGGAO0AACA GGGUOAUOGC COUiXiCTOCC AGA6GAAACC ACGGCGCCCC OACOCACUAC 

4(53.0 4530 4530 4540 45SO 4560 

&mcmm cgsaggcguc gcagcgogug acccaacoac ot)GGcocGcw uagbaomcc 

4570 9S0O 4590 4600 4610 . 4620 

AsecyAcucA gaagacocca caawjggaua aobaggacu gkkcauccg abgoiccgga 

4630 4640 4650 4660 4670 4600 

UCCOGGCUCC GOJACGUGUG GGAjHjeGGOtf OSCACCAtJCO BGACAGAOA7 CAAAAAHSGG 

46S0 4700 4710 4730 4730 4740 

CWAOCOCCft AAWGUOCCC CAAGCOGCCC GGCCTOCCCW DCRUCOCWJB OeAAAASGGG 



UACAAGGGUG WGOGGCCCGS CACOeQiCMX: AUGACCACGC GafcCeCOQG CC-GCGCC&AC 
4810 4820 4630 4840 4050 4860 

aacocoGGCA AD30CO3CCO GGeeucoAtxj Asexocwae osccdaamc CM3CAOG6AC 

4870 4BB0 4890 4900 . 4910 4920 

AO0GGGCAGG GGACCWWC TOOCAAUQGC ^CAOGGAGO GCCAGUGCGC GCCGAftAOCC 

4930 4940 49S0 4960 4970 4900 

COCSOClAACtf ACAAGAO0GC CAUCOGOAGG GUGGCGGCCO CGGaGUACGC GGAGGUGJsOS 

4990 5000 5010 6030 5030 5040 

CAGCAUGQGO CGGACPCCUA (JGOAACAGGA CUGACCAOX3 ACAAUCUSAA MWCCOTGC 

5050 5060 5C73 5080 6090 S100 

CAAcuACcuo cuocagaguu uuocyccuGG goggacggus ogcagaocca uaggbougca 

5110 S12Q 5430 5W0 5ISO 516D 

CmCftCCAA ASCGSUWUO CCGGGAUGAG GUCEJCGttOCtf GCGUUGGGCU UAAUOCCVAO 

S170 5180 5190 5300 3210 §32Q 

C-CUGUCGGSO 0CCAGCWCG OWSAACCS) GAGCCCGACG CAGAOGUAWtJ GAGGUCCAUG 

$J30 £240 525Q 5S60 5370 5280 

CtfAACAGAOC OGCGCCA^U GAQSGCGGAG ACOGCQSCGC GGCGCUOGGC ACGGGGAgCA 

5290 5300 ■ 5310 5330 5330 5340 

COJ0CAUC0G AGGCGAGO0C CUCAGUGAGC CAGCUATJCAG CACCGUCGCO GCGGGCCACC 

5350 5360 5370 5380 5330 5400 

UGCACCACCC ACAGCAKCAC COAtSQAOGDQ GACAUGGtfCG AUgCCAACCU <JCUCfiUSGAS 

S410 5420 5430 5*40 5450 5460 

GGCGGUGOGG CUCAGACAGA GCCUSAGtTCC AGGGUGCC0G U0CO6SACUU IKUCGAGCCA 

3470 3480 5490 5500 5510 5520 

MJGQCQBftGG AJVGftGAGOSA CCUTOAqCCC tXSWttCCMT CGGAGUGCMJ GCUCCCCAGG 
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[Figure 2E] 
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AGCGGGoroe cmmccm Accqbwqss gcacggccus acuacaaocc gcogcucguq 
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^(iUJOSllGSA GGiffiGCCfiQA WRCCS&CCS CCCACCGUUC! CUGQiJUGOGC OCUOCCbGOC 
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OCG&ftaAASQ CCCCGACGCC UCCCCCAAGG AGACGCCGGA CflGUGGGSJCU GftGOGAGRGC 
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Acasnocftd mgcccocca gcaacoogcc aucaagacci) otggccagcc ccccucgagc 

5770 5780 5790 5800 . 5810 ' 5820 

. GCUCQOOCAC CXK5GGCGGGC <3C03CCGAA'J CCQCCCGUeC GAOGUCCCCU 

5830 58*0 5e50 5860 5870 5850 

(sjyGAsecsa cccccucaoa gacagguucc occuccucm uccccccocu cGA&asseAG 

5890 5900 5910 5920 5930 3940 

COKMAGAUC OCKACCUGGA GUCUOAtfCAG QUAGAWTOC AACCUCCCCC CCAGGGGSGG 

. 5550 5960 5&70 5380 5990 6000 

(maumcoc cogquocggg cuoGeequcu uqcucuACutr ^xxxgagoa ggacgauacc 

0010 6020 iOJO 60-10 6050 6060 

ACCGUGUGCU GOTCCWKUC AUACUOCOGG ACCSGGGCUG BAAVAACPCC (S^UflfiCCCC; 

GMQ 6080 5050 €100 6110 6120 

OtofceMSSWl AGW5GCCAAU CAAOCCUUOO AiSUAAOUCGC Q3UIX3C9AVA COWJMCWQ 

' 6130 «40 6150 «160 6170 61S0 

GtSGtJACOGUA C&ACAUCAAA GftGOSOWCR <a<3A66GClM SAAAGGOMC WOTS&C&se 

6190 6300 6310 6220 6230 6240 

ACGCAASDSC U03ACOCCCA. U0RW3AOTC& GOOTOAAAOO ACSUCAAOCU ASCCOCOOCO 

6250 6360 6270 6280 $390 6300 

CAAWCOCOJ C&CCOTIGGAG OAGGCGCJGCC ACUuaftCUCC ACCCCAUUCU 

6310 6320 6330 6350 6360 

(3CAASAUCCA TiGOM&hW OGG&3CCAAO GAGGUCCGCA GCUUGCJCOG0 GAGGGCGGD0 

6370 6380 6390 6400 6il0 6420 

AACCACAUCA AGWXSUGUG (3AA6SA0CUC CUGGAAGACC i&CAAACACC AAWSCCACA 

<S430 6440 6450 6460 6470 . 6480 

ACCAUCAUH6 CCAAAAAUGA tWIKOTJCWC eSe&ACcccG GCAA6GGGGG UAASAAACCA 

6490 6S0D 5510 653D -. 6530 6540 

ggocgocuca ucgwuaccc ugaccucsgc eyccesoocu ecoAOAAMu ewcucgau 

6550 6560 6570 6580 6S90 6600 

GfcCAtMACAC ftAMGCOUOC UCASGCGGPA RWOSA^fCOO CCOAUGGOKJ CC8Gt?flCUQC 

5610 6620 5630 664* 6650 6660 

CCGSCCCAfcC GffSUGQACJUA UCOCOUGAAA GCMJSGGCGS AAMGAAGGA CCOCAUSiSey 

6670 6660 5690 6700 «710 57X0 

WOyCWAUS AUACCCGAUG CWKXJACyCA AOQCJUCACUO AGftGAGACAU C&GSACC6A3 

6730 6140 675G $7*0 S^O 6790 

GMftKCMflW JKJCWWCCTO aiOCCOGOCC GMeWSJCep KBCWCCW ACAC0CGCUG 

6790 §800 68iG 6830 ' 6840 

K&zhQMiw mmsom agggcccaug uucaacagca aggqucamc cugcgguuac 

6850 6860 S870 6990 6890. 000 

GCGOCAGCGS GSUGCUAAOC ACaAGCAGGG GUMCAOCAU OK&UeCUSg 
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[Figure 2F] 
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Goameccc: imeoaeecus cwhsgcuscg eoGWJftcisje cecsxAcmu ecoseyatKC 
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AGAGCOJUCA OSSAGSOPAO GftCCAGGUAC ycUQCCCCOC WSGUGAUCC •CC0CAG&SO3 

7050 nOO 7JiO 7120 7130 7i« 

GMUAOfcACC OSSAGCUAAQ AAOWOOKSD OCCOCAAAUG yGyCOSDGSC GW5GSGCCC6 

3150 ?l€0 7170 71S0 7190 7300 

CSSOOmCC GCAGAUACUA CCOSACCAGA GACCCAACCA CVCCACUCQC CCGOSCOOCC 

7210 7320 7230 7240 7250 7360 

TK^AWCWJ UUAGACAOJC CCCUAOCAAU 13CAOGGCOGQ GAAACAUCAU CCftGUAUGCU 

7270 7280 7290 7300 7310 7320 

CCAACCWAV <KGl3O0GCAV GGOCCUAAUG ACACACWCU . UCS3CCAUUCU CA060UCCAA 

7330 7340 73S0 7360 7370 7380 

GACACOCGGG AOCAS&ACCU CAACUUUGAO AUS0AW5GAU CAOTAUACUC CGW3AAUOCU 

7390 74QO 7410 7420 7430 7440 

WKBACOUUC CAfSCCAPAW UGAGASSUUA CAO3G0CUW3 AOCSCCUUOUC UADGCACACA 

7450 7460 7470 7460 74S0 7500 

UACUCOCACC AO3AAC0QAC G03GGIKK5CU OCAGCCCUCA GAAAACUOGS <*OKXSCOC 

7310 7520 7-630 ?S*0 7SE0 7550 

OJCAGSSBSO GGAAGAGOCQ G6CUCGCGCA GUCRGGGOGU CCCUCAOOOC CCSWJOAGGG 

7570 7SSQ 7590 7KQQ ' 7610 -763Q 

AAAOO30CC6 OOOGOGGCOa AUAU&KOTC AAWGSSOG0 U6AAGAOC&& GOJCAAAOX3 

7630 7640 7650 7660 7670 7680 

AOXXXAUOGC asffi«eCGC5Q COJACOGGAC UOMKXSQDU SGUUCAC0G0 CGGCGCtKGG 

76?0 7700 7710 7720 7?30 7740 

(^C-GGCGACA VWUUCACAG OSU00060<X CCOCGACCCG 6CUCAWACU C00O33CC0A 

. 7750 7760 7770 7780 7730 780O ■ 

cuccuAoroo ocsyASOGGo aggccocuuc aiAcucccos - cucggwsa© cgscacacac 

7810 7830 7830 7840 7050 7860 . 

wgguhcacp ccauagcuaa ctxjwecuou tnnroyuuuuo oouuuuouou uuouutswuo 

7B79 7680 78SO 7&00 7910 7S20 

uutnjouuuuu araoouuuuu vuouueceoc wwcuocccu ucocaocwia uucuAcwwe 

7530 7940 -79S0 7960 7970 7S0O 

ooucouggog carccAUcuo agcccoagoc aoggcoagcu gugaaagooc cguqaocqgc 

W0 8000 6010 8020 8030 8Q3Q 

ATOAOISCAS AO&tiOGCGSti AAC03GUCOC ucoscacauc hVSQ 
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[Figure 3A] 
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cccccyocos ggagagccad aguc^ucugC ggaacoggog aguacaccgg AAOtiGCCGSS 

190 200 210 220 230 ' 240 

AAGAE3JGGW CCOTOCOOGG AOAAACCCAC UCOAVGCCCG GOQftJUtJQGG OGOOCCSCCe 

350 350 270 280 290 300 

CftAOAeuGCU AGCCGAGUAS OSttOGGGWXJ CSAAAGGOCtf OGOGGOAOXJ COXJAWGCG 

310 33Q 330 340 , 350 360 

ugcougogag ogcoccggga ggikooguag accgoocacc augascacaa auccca&acc 

370 360 390 400 410 S20 

UCAAAGAAAA ACC/lARAGAA ACACUAACOQ tKGCCCAAUG AUUGWlCAAG AUGGAUDGCA 

430 440 450 460 470 480 

cgcagghuco ccggocgcusj ggguggagag gcuauooggc WUGACUGGG CACAACAGAC 

500 S1<J S20 . . S30 530 

AABCGGC3JGC VCUGMJGOT3 GOGuGWCCO GCVGOC'AOCO CAGGGGCGCC CGSUUCSUUtJ 

£55 560 S70 560 590 600 

W30CAASAOC GACCUGUCC3 SOGCCCWJAA UGAACPGCSG GACGA&SCAG OGCGGCUAOO 

■610 620 630 640 650 660 

Gffi&OJXCC ACGACGGGOG 013CC0XJSCSC AGCUGU6COC GftCGVVGyCA COOAAGOSSG 

670 6»0 690 706 710 720 

AAGGGACUGG CU3CUAUOGG GCGAAGUGCC GGGGCSGGAU CUCCUWCAU CtfCACCOOGC 

730 7*0 750 760 "370 ?00 

0CCO3CCGAG AAAGUAUCCA OCftUGGCUGA OGCftAOGOGS O3GCUG0AUA CGCWGAUCC 

790 800 810 $20 830 ' 840 

GGCUACCUGC CCAWCSflCC ACCAAGCGfiA ACAUCGCKUO GAGCSAGCaC GfJACJCGGAU 

050 660 870 OSQ - 890 900 

ggaagccggu euosoesjsoc aggmwaocu ggacgaagag caqcaggggc ikscgccagc 

S10 920 930 940 950 9S0 

CGAAC^OaC GCCaGGCDCA AGGGGCGCSU GCCCGAOGGC .GAGGAOCUCG UCSOGACCCA 

970 980 990 1000 WW 1030 

IXSGCGAUGCC USCfflJGCCGA AUAOCAW5GU BGAAAAtSSGC CGe»W)yCO& GAUOCAUOGA 

1030 1040 1050 1060 1070 1080 

cyGOGOCQGS COGGGtfGlSSG O3GACCGC0A WAGGACaOA S03WGGCVA COCwGAuAU 

1090 11O0 1110 1120 1130 1140 

UGCOOAASAffl CtXJGGCGGCG AMX3GGCOGA <X£OTUeC0C GqGOJOUACG G0AXJCGCO3C 

1150 1160 1170 1180 1190 JZOO 

OOCOGAOUOG CAGCGCAUCG CCWCXISWCG CCOUCWGAC GAGUDCWCU GftGOWAAAC 

mo 1330 1240 1250 1260 

CXSJCUCOCW CCCCCCCCCO MCGOUACOS GCCGAAGC03 COOGGAAUMi GGCCGGOGUG 

. m& 1260 1390 1300 1310 ' 1320 

OSUOTGDCUa UAUGyUAUWJ UCC&CCAUfiU OGCCGUCTJUtf DGeC&AGG&S AGQGOCCMfc 

1330 ' ma 3350 1360 J3^0 1380 

AAceiiGGCcc dgsjcuucuug Aoe&ecAuuc cuags&dcu uuccccwcuc gccaaaggaa 
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[Figure 3B] 



1390 1406 1430 1420 1430 1440 

wa&x&vco GOUGAAy&iq otioaAGGAAG cagougcocu ©gaagcquco OGMGAeAAA 

1450 1460 X470 1480 1490 1500 

cAAOsacoGu Mc&hoxm «6caGa*ec ggmsxcccc Accuscicafec aggugccucu 

VSW 1520 3.530 • 15^0 ■ 1550 1550 

gcggccaaaa eccAOG^suA uaagauacac cugcaaaggci gg<;a<&agcc! cAmiccaActs 

iS7ft iseo . isso igoo 1610 U20 
isgugacwx; aftuftcoueys gaa&gaguca aausgcow? cocaagcgua uucaacaagg 

1S30 1640 1650 1650 1670 16*0 

. GGCtWASGGA tf^GAGAAG GtfACCCCAW G0A1*53OAUe UGAUCOGGGG COXXkaJGCA 

1690 1700 1710 1720 1730 17*0 

CKUGCOUUftC AW30GUUUAG tJCGAGGUUAA AAAAACGOCy M&CXCCCm AaCCACGGGG 

17»s0 1760 1770 1760 1790 1*00 

ACG03GWXJU COWUGMAA ACACGWAAU ROCSWOCCC ccavcaccgc WAOGCCCAG 

1820- 1030 . 1840 1850 M60 ■ 

CAGACAQ3&G GUCyCUUGGG CUCUAUAGUS GOGAGCRUGA CGGGGCGtiGA CAAGACAGAA 

«70 1800 taso isoa ioio isao 

CA6GCCGGGG AGGOCCAAGU OSStfCCACA GUCACSKSGtf CCTOCCVCSG MCAUCCAUU 

1930 • 1940 1950 I960 1970 1980 

0OSG6GGOCU UAUOTACWSO OTACCA066A GCOGGCAACA AGACACUASC CGGCaCCCGG 

1990 3Q0O 201Q 3020 2030 Z040 

GGCCWGUCA CGCMJMJGUA CWSA«,X5CC GAGGGSGACU OGGUOGGGOG GCOCAGCCCO 

3060 . 2060 2070 2080 2090 3100 

ccwesACCA MixwpeoA gccguguacg UG03GAGCGQ iksaccugoa uouggocrcg 

2110 2120 3130 2W0 21S0 2i$fl 

C30AACGCGG AU3JCAUGCC GGO50SAA0A CCSOSGGOACa. JS3C66SQAGC CCOSOXiOCC 

a 170 2180 219(1 2200 2210 2220 

COSAG&CCOC UUKGACCQO GAS6GGGWC UCG6GGGG&C CUGUGCWWS CCCWGGGGC 

2230 224£> 2250 2260 2270 2280 

CAOGCUGUCG GAAUCU0CO3 GGCAGCUGWG OaCUCUCGGG GO203GC0AA GEJCCAUAGA0 

KS90 3300 2310 2330 2330 2340 

UUCAUCCCCG OUSAGACGCtf tX3ACAVCGt}C ACOtGGU&JC CCACCOOUAG DGACAACAGC 

5tS50 2360 2370 2310 2390 2400 

ACACCACCAG OXSyGCCCCA GACCOAGCAG GOGGGGUACJ UGCACGCCCC CAOMGCAGU 

2410 2420 2430 2*40 24S0 2460 

ggaaaaagca ccaasGocee cgooscgoac gccgoxagg ggoauaaagu gcugguscuc 

2470 2480 2490 2500 2510 2520 

AAUCOCUCGG UGGCyGCCAO CCOGGGATOO GGGGOGUACW WJGCCAAGGC ACAOGGCAVC 

2530 "254D 2550 3560 2510 ' 2£>80 

AACCCC31ACA UUAGGACUGG AGUCAGAACU GOGACGAC03 GGGAGCOCAt) DACAOACUCC 

2590 2600 2610 2620 2630 2640 

mmmvh amwcchosc. cgausggggc ugcgcaosos gogccuauga caucaocaoa 

S6S0 2ifs0 2C70 3660 • 2690 2700 

xs&asmmts) gccacijcusd oeauocoAcc aojauuotcg gcmcgggac agvccuugac 

2710 2720 2120 2740 2750 2760 

CAAGCASaGA CA6CCG6GG0 ^AGSCOaACU CSOACUGGtCA CCmaCGCC CCCCSGGiJOS 
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[Figure 3C] 



2S00 2910 2S30 2940 

CACUCAAAGA AAAAGU&UGA CGASCIKSCA ACSGOCCOOC GGSSCJSXSee OUUGAftOSCiT 

2$5C 2:970 2980 235ft 3000 

fiUGQOMmUD &C&GAGGGW GGACGUWCC AUAAOACCAA O0CAA0GA6A UGUSGSGGUC 

mO 3020 3030 3040 3050 . 3050 

otKjscCAcee &a5CCOK3\u qacggcguau AcasoABAca iRxSAeueosu oaue©Acoec 

3070 30S0 3030 3X00 3110 3120 

OC&CCCAffiZC CffiJAGAeoiXJ AC<KC*33ACC CCACCVVCAC VAUAACCACA 



3190 3200 3310 3220 3830 3240 

AQACOGGGCft UUUAUASSSUA UGOffiKSCACOi CCUSAQCGAS CCOCAQGAAO GO0UGACAG0 



3310 3320 3330 3340 3350 3360 

AOGACCGUCft (X3COCAGGGC CUAUOUCAfcC ACQCOT5GC0 OSOOQGUaJCi OCAGGRCCAC 

3370 3360 3350 3400 3410 3430 

CUUSAlWDUt! 0G0ASSCW3O WUtACCfiGC CWCACACACSk UMSACGOXA tXXOCCSKStfCC 



35S0 3960 3570 3590 " 3590 3600 

AAWCC^C^.0W«5GGC<K.DA(^3<3^ CUGWCOSUEJ tfSGGCOCUSU UACCAAOSAG 



3730 3740 37S0 3760 3570 3790 

WftGCCAC<?3 GOT30GUWJC CAUCAUUGGC OOTUOACACA UCftACCAGCO AGCOGIK3GUG 

3790 3SO0 3810 3830 3S30 3840 

QCUCQSGACA AGSASGOCCU CUAWJA6CCU UUU9MJSAGA UGCACOMtKS OGCCUCCABA 



3$SQ 3990 4000 4010 4020 

GCCAW>VGG A0CAAUOTG t^CCAAACW AUgUGGAaCU" UCMA&GOKJ CAWCAGiMC 

4030 4<T40 * 4050 4066 40**0 4080 

fflCCCACQaC WWCAACACO GCOBOGOfcfcC QCPSCUSCKje C000CMJSMI ©SCAUUCAOC 

- d0<*0 " 1X00 41X0 4120 4130 4140 

Gceececoca cokwcww QKftmmc accbocjbjoc WCUUMCKO 
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[Figure 3D] 

£150 4150 4170 4180 4190 *2Q0' 

useepGaeGO aschmsmc accftcceooG eececsAcoe cmouoguogo cagugoccoq 

423.0 4220 4230 4240 4250 4360 

mmboeoQ cuswsocag cauagocuoq scsuasagugc ugguccacau ccuggcaggo 

4270. mo ma «ot> «io 4320 
mvwmwG .ecswuceOQ eocecoosuc .goguwaaga ikSaugucogg cgagaagccc 

4330 43*0 .4350 4360 4370 4380 

p(^U6eiO& JOJSUCAUCWt OTSJCCUGCCU OSCAWCUGa CUCCAGSOSC UCW3C**5«JG 

4390 4400 4410 4420 4430 4440 

GGASOqAOCO GOSCGQCCAO OCUGCGCOGC CAUGOGGGAC CGGOGGAAGG CCCX3G0CCAA 

4450 4460 4470 4480 4490 4500 

WGAWAACA GGCOUMJCSC CWCQQXXX AGAGGAAAOC AOGUCGCCCC UACUCACUAC 

4510 4*20 4S30 4540 4550 4560 

GUSACGSAGU OGQAOSCGOO GGftOCGVGOC A<XJC*ACUeC WJGGCUCOCU CftCUAyAACU 

4$7o 4500 4590 4600 4610 4620 

mxohCJch ©asacoucsi carcosjaoc aok*,g3Wai gccccawooc auscgccgcc 

■ 4630 4«>40 4630 4««» ' 4570 4580 

yoGoeectKc gosmsgusug gsacogggoc teoAocAocc vaacwjacoo OMeAAcoss 

4630 4700 4510 4720 4730 4740 

cugaocucca AGoxniyccc Ahao&OGCaj esccoccccu wwoo<xm ccaaaagggq 

4750 47*0 4770 4780 4790 4800 

VACAAGG603 tfOJGGGCCGC CAOXSGCAIJC AUGACCAC&C O&USCCCCU© C<3<3CGCCAAC 

4810 4820 4630 4840 4850 4860 

GGGCUCUADG ASAAUCACSO GAOCCAAAftC C06CAOQAAC 

4870 4800 4890 4900 4910 4920 

ACCUOGCAOS 9SACCUUUCC X)AOCAA0O3U UAVACAWAO CCCAGUGCOU GCCBAAACCC 

4930 4940 4950 49CC 4970 4983 

0C6WAAAOJ UCftAGACOGG CAOOUGQAGA CAGAGDACSC GGMG^iCG 

4990 5000 5010. 5020 . 5030 £040 

CSGCACSGAU CAOAUGCO0A OAXIAACSGGQ COGACCACUS ACAACUUA&A AOUCCCOOGC 

S050 §050 .5070 . 508O 5090 S100 

CAACUCCCCU COOCAGAGBS WUW&JUSO GOGGACSG2S3 UACAAAUCCA UAG^OCSCC 

5110 5120 5130 • 5140 5150 5180 

COCACACCAA AGCOSmWUU CCSeGAUGAO SUCOCG0VCA CSCGUltfffiOCU CAADUCAUUU 

5170 5J80 5190 5200 5210 5220 

GEICGUCGGGO OKaVGCOOCC CiraOQACOCO GAGCCOGACA <3X?A.0QW>GV GmQCCWS 

5230 5240 5250 5260 S270 5280 

C&AftCAGACC CAOCKCAUAU CftCGOOGGAG GCOOCAGCGC ©3CGUWAGC GOGGGGGOCA 

5290 5300 5310 5320 3330 S340 

CXCCCAVtUS MKCAftSKUC eOCfcGC&AGO O&CWUCCG CSCCAU0GCO OCOAOOCftCO 

5350 5360 5370 5380 5390 5400 

IJSC&CCACCe ACGOlttfiOAG CU&UQAUSOS QACAUQGUOS AUGCCAACCU GW3CAU&&G 

5410 5420 5430 5440 5450 3460 

GOSQCBWft UUCWAUAOA GUOJOAOOOC WA6UGCOOG DOOJGGSCUC CC008ACUCA 

5i70 5480 5490 &50O SSiO 

AUSftGOGAGG. CCOlfGASCCy UCAGUACCftU CCW^UAWV GaJdCt'CAGG 
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[Figure 3E] 



5530 5540 5550 SS60 5570 5S8Q 

aasagswcc cwwwscuo accggOkxjg ecocwccoa aoowsmocc acceeuoayo 

S^SO 56Q0 561.0 5620 5*30 5640 

ASAGGCC&GA OUACCAACCA CCCACOCWS Oa'CtCCCCC 

5650 5630 5670 5660 5650 570© 

CCCAAAAAGA CX<X)GA03CC UCCUOCAA&J AGAOSCOaSA C&GOGGQUCO OAGCCfiGAGC 

5710 5730 5730 574.0 5750 5760 

AOCMJAGfiAG AUGCCC00C& ACAOCUGGCC AUCAACSOCCtf WJGGCC&SCC CCCCCCAR0C 

5770 5760 - 5730 POO S810 5B20 

GGCQAUUCAS GCtSWJUCCiC GGGS30GGAC GCOGCCGACO OOSGCtJAVOG CACAOOCCCU 

5^30 5640 5850 5860 5870 ' 5880 

Gkosmxm. cocuwcgga OACAGewpy AcctrseuoeA yscocxmsJ csagcgggag 

5890 5900 3$10 5930 5930 5640 

ccaeoseAcc casaccugba (xkugagcag wmmwc mccuccugc ccaggqsggg 

5550 5960 $£70 S960 5530 6000 

CXXGmxm CUCGGGGBOC UGGUCUACOU GCOCCSAQGA GGAUGACUCC 

6010 6020 6030 C040 6050 6060 

GCJOOOGUGCU GCUCCAUGyC AWJOCC0G3 ACCGGGGCUC UAAUAACUCC tJUGUAGCCCQ 

6070 6080 5090 5100 6U0 $120 

smiswsoam aguibocami uaacuocuoq AGmcocsc u^jwg^rpa osaiaaaaa 

6130 5ld0 6180 6160 6170 6180 

&)Mhvmm ojacaucaaa casogccoca coa«3Gocoa aa&agguaac w&wmoo 

. 6190 5300 6310 6220 6830 ' 6240 

avscaasogc ucoaccccua upAusAuota gucouaaags acaucaagco flscseccgoc 



MG0UCA0O3 CAf^GOTCCU CACeyOAGAS GAGGO3UG0C AAOUOACfcOC ACCCCAClXX? 

6310 6320 6330 6340 6350 6360 

GCAAGAtfCCA- AGUAVGWW DQSGGCOAAG GAGGUCCtiOA GCOWSOraaS GAGG600SOG 

6370 6380 5390 6400 6410 6*20 

AAt'CACAlJCA AGycO6O30S GAAQ3ACC1W WX*5AA<3ACU CACAAACACC AAUUCCUACft. 

6430 6440 6450 6460 6410 6400 

AceaecsosG ccaaaaauga ggogwjokc goggaccccg ccwusggoss oaaaaaacca 

6490 6500 6510 6520 6530 6540 

60J03CCUt>A OC0000AOCC BGXOCUCG0C GtXSftGGSGCe GCGASAASW GGCCCUWAU 

6550 6560 6S70 6580 6590 6600 

SAUOJC&CAC: AAAAGCtKOC UCAG3CGGWJ AWOGQ^JW? OXJMXSGOTJ OTA0UACOCC 

6610 6620 6630 6640 6650 6660 

CCCGWCAGC GGGUGGAGUU UCUCW3AAQ GCAtXSOKGS AAAAGSGfiGA CCCOHUGGGO 

6670 6680 6690 .6700 6710 6720 

UHBUCGUAUQ AUACC03MJ6 CUUDSACUCA AW3UCACWS AGAGAOACAU CAGSACUGAG 

6730 6740 6750 6760- 6770 £780 

GAGUCCMJAU AOCAGOCCPS C0<XmO0C"GA<»AGGCtt: GAftCUdCCM) AGAOXmJS 

6790 fieOO 6*1(1 ' 6820 6830 6fr<J0 

AOlOAGAOftC OC0ACGO3GG AGGGCCCMXJ OUCAACftUCA AGQGOCAGUC CUGGS6GUAC 

6ftSo 5&6G 6670 6380 6900 

AQGCOUOGCe GCfiCCAGCGO SaUGCUUACe ACUACDAMGG ^ASCACCAU CACMJSCDfiU 
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[Figure 3F] 



69*0 $320 &M0 6?4Q $950 

soaaamscoc yAeooGccos casgggpgo? G&mmm oseecftOGMj Gtvmytmc 

£970 $990 7000 7010 7020 

oscGhcms) wmcavm} cucag&aasc casggga<P3 ^sa©sAcsA cc^aacoss 

7030 70 4 G 7050 70*0 7070 7060 

.-sGAGCCUUCft O30AGGCUAU GACCAOGDAU UC0SCOCCUC CIX3GUGACCC CCCCftGACCG 

7090 . 7100 7il0 7120 7130 7S40 

oaaoaogacc osGasajjiKj aacagcotgo yceuCAwiCQ UGOcmyecc acuuggcgca 

7150 7160 7170 7180 7190 7200 

CM3GG00flOC GCASA0ACO& CC^GACCASA OACOCCftCCA COyCAMJOSC OXWGCUGCC 

7310 7220 7330 7240 725Q 7260 

■ oesejAACAQ uu-agacacoc ccaseocAAu ueAooseoee gaaacaocau ccagoaoscu 

7270 73§0 7290 7300 • 7310 7380 

OCSSOatMl GKGDCGCAIX ftWlCACUOCO CAUGGCOGAG 

7330 7340 . 7350 7360 7370 73S0 

SACftCCOJAe ACOWSAACCU UAACUWGAA AUGtlft0$<?At3 CG<5(JGt5ACOC OGlXsASyCCW 

7390 7400 74X0 7480 7430 7440 

CUQOACOJCC CfcSCCRWA&U GGAAAGeOQA CAOGGCCCfOQ AOSOCOOCOC XKXX5CACACA 

7450 7460 7470. 7480 7490 7500 

cacacocccc acqaawjaO $os<5aweaf yessoocucA ©aaacxrxss ggcgccaccc 

7510 7520 7530 7540 7580 7560 

OUCAQMOGO GGAAGASUOG GGCGCG00CA <3WJiK3©KSU CCCOCAOCOC CCGOGSGGeQ 

7S70 7580 7S90 7600 7610 7620 

iusewwGcoe utwecssoas gvaccucwc AACusseGoee BeA&3AceAA gcccaaacvc 

7630 7640 7650 7«60 7670 7680 

AaKXXiTOSC ceSMGCAOT COJCCVGQAU WJSUCCAQUU JSSUUOACCSO O3CCGO930C 

7690 7700 7710 7750 7730 7740 

SGCG0CGACA OSUGUOGcOu GCC03ACCCC GCCOTUTOACO CCUIWJCCUA 

?7S0 , 7760 7770 7780 7790 7000 

CUCOUACUUD cUGtfAGGGGU AOGOGOCOUC COACOCQCCO CW3AUAGAG CCGCACAGflK) 

70S0 7830 7S3G 7840 78S0 7050 

USGOJACftO) CC&UASCUA& CUG'JOCCUUU UWUUUOOW WWJWJDOGO OWUUtfUCBO 

7870 7880 7896 7900 7M0 7920 

tfuuoyysoau cuouuuuouu mraOuceeoe ooucuocccu tfcvcAucwA wcyAcuuac 

7930 79-40 795D 756© 7»7Q 7980 

VUDCyOGGOG GCOCmxm AGOCCOASOC ACSOCWSCO GOGAAAGGOC CeUGAGCGGC 

7990 8000 8010 8020 8030 8040 

AUQACUSG&G ACAGWGCQGtf SACttSGOCOG UCWCftGAUC AOOO 
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[Figure 5| 
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[Figure 7] 
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[Figure 8] 
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[Figure 9] 
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[Figure 10] 
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[Figure 11] 
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[Figure 12] 
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[Figure 13] 
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[Figure 14] 
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[Figure 15] 
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[Title of Document] ABSTRACT 
[Abstract] 

[Technical Problem] An object is to provide a replicon RNA that is derived from 
HCV of a different genotype from genotype lb. 

[Technical Solution] A replicon RNA comprising a nucleotide sequence at least 
containing the 5' untranslated region, the nucleotide sequence encoding NS3 
protein, NS4A protein, NS4B protein, NS5A protein and NS5B protein, and the 3' 
untranslated region on the genomic RNA of hepatitis C virus of genotype 2a is 
provided. 

[Selected Drawing] None 
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